Plants having increased tolerance to heat stress

ABSTRACT

The invention relates to methods of producing a desired phenotype in a plant by manipulation of gene expression within the plant. The method relates to means which increase the level of expression of a MYB-subgroup14 polynucleotide or a MYB68 polypeptide. The method also relates to expression of a nucleic acid sequence encoding a MYB-subgroup14 or a MYB68 transcriptional factor. The methods are directed to elevating the levels of a MYB-subgroup14 or a MYB68 expression, wherein a desired phenotype such as reduced flower abortion and increased yield during heat stress is observed. The invention also relates to nucleic acid sequences useful in such methods.

REFERENCE TO RELATED APPLICATIONS

This non-provisional patent application is a continuation of pending U.S. patent application Ser. No. 12/148,548, filed Apr. 18, 2008, which claims priority under 35 U.S.C. § 119(e) to U.S. Ser. No. 60/925,312, filed Apr. 18, 2007, and U.S. Ser. No. 60/965,582, filed Aug. 20, 2007, the contents of each of which are herein incorporated by reference in their entireties.

INCORPORATION BY REFERENCE OF SEQUENCE LISTING

The contents of the text file named “PREP-014C01US Sequence Listing.txt”, which was created on Jan. 26, 2017 and is 439 KB in size, are hereby incorporated by reference in their entireties.

FIELD OF THE INVENTION

The invention is in the field of plant molecular biology and relates to transgenic plants having novel phenotypes, methods of producing such plants and polynucleotides and polypeptides useful in such methods. More specifically, the invention relates to the use of MYB polynucleotides and transgenic plants expressing these polynucleotides and polypeptides.

BACKGROUND OF THE INVENTION

Environmental stresses are responsible for significant yield reduction in agricultural crops. In addition to many reports published previously, the relation between climate variation and production of corn and soybean throughout the United States for the period 1982-1998 was studied in recent years (Lobell and Asner, 2003). Gradual temperature changes have made a measurable impact on crop yield. In corn and soybean it has been estimated that yield is reduced by 17% per degree as the growth temperature rises above the season optimum. With a predicted temperature increase of 1.4° C. to 5.8° C. between the years 1990 and 2010 (IPCC Working Group I, 2001), improvement of high temperature tolerance in crop plants has become one of the major focuses of agricultural biotechnology development.

Both monocots and dicots are particularly sensitive to heat stress during flowering and seed development and therefore heat stress has a significant impact on seed yield (Young et al., 2004; Sato et al., 2002; Angadi et al., 2000; Carlson, 1990; Wahid, A., Gelani, S., Ashraf, M., and Foolad, M. R. (2007)). It has been suggested that plants possess an inherent ability for basal and acquired thermotolerance and that a common heat response mechanisms is present in diverse plant species (Kapoor et al., 1990; Vierling, 1991; Flahaut et al., 1996; Burke et al., 2000; Hong and Vieling, 2000; Massie et al., 2003; Larkindale et al., 2005). Basal thermotolerance allows plants survive from exposure to temperature above optimal for growth, whereas acquired thermotolerance is induced by a short acclimation period at a sub-lethal heat stress which enables a plant to survive a subsequent heat stress that would be otherwise lethal. A number of studies have been conducted to identify and characterize genes and pathways that are involved in plant thermotolerance. For example, heat shock transcription factors (HSF) and heat shock proteins (HSP) have received much attention to elucidate the roles and effects of these genes in response to heat stress as have plant growth hormones such as abscisic acid and ethylene.

Transcription factors are DNA binding proteins that interact with specific promoter or enhancer sequences and alter the gene expression of the associated gene. Where the specific sequence that binds the transcription factor is associated with a suite of genes whole pathways can be coordinately regulated with various component genes being simultaneously up-regulated or down-regulated. A transcription factors may coordinately alter a suite of genes in response to a stimulus such as an environmental stress, nutritional status or pathogen attack, for example, or can be a component of a signaling pathway, such as a hormone signaling pathway for example. Transcription factors posses a modular structure and are classified primarily on the basis of the DNA binding domain.

The MYB family of transcription factors is composed of at least 198 genes (Yanhui et al. 2006) and has been proposed to have regulatory functions in a wide array of processes ranging from growth and development to defense responses. Plant MYB proteins are classified based on the presence and number of imperfect MYB repeats each composed of about 52 amino acids. The MYB domain forms a helix-turn-helix conformation and represents the DNA binding domain. Three major groups of MYB proteins have been classified as R1R2R3-MYB, R2R3-MYB and MYB-related proteins.

The R2R3-MYB family of proteins in Arabidopsis consists of 125 proteins and is characterized by having a R2R3 DNA binding domain at their N-terminus (Kranz et al., 1998, and Stracke et al., 2001). These genes are involved in a number of biological processes including mediating hormone actions, secondary metabolism (Paz-Ares et al., 1987), control of cell morphogenesis (Oppenheimer et al., 1991), meristem, floral and seed development (Kirik et al., 1998, Schmitz et al., 2002) and response to various environmental factors (Kranz et al., 1998; Jin and Martin, 1999; Meissner et al, 1999).

MYB sequences have been further classified into a number of subgroups based on sequence (Krantz et al 1998, Stracke et al 2001). MYB68 falls within subgroup14 as does MYB36 and MYB84 as identified by Krantz et al 1998. However, Stracke et al 2001, have additionally include the MYB37, MYB38 and MYB87 in subgroup14. Stracke further notes that there are several cases of functional conservation of genes that cluster together in the dendrogram.

Classification of the R2R3-MYB family has identified 125 MYB proteins in Arabidopsis thaliana (At). A R2R3 MYB gene is characterized by a MYB domain containing two imperfect repeats of 53 aa (R2, and R3). Each repeat contains three helix-turn-helix structures. The R2 and R3 domains are located near the N-terminus of the proteins. The last two helices on each repeat with a loop between them form a DNA-binding motif structure similar to HLH proteins. The third helix directly binds to DNA, and the first and second helices contribute to the conformation of the HLH motif that appears to be important in recognition of a specific gene target (Ogata et al., 1994; William and Grotewold, 1997; Jia et al., 2004). The R2R3-MYB proteins were further characterized into 22 subgroups according to their phylogenetic relationship based on at least one of the shared amino acid motifs in addition to the MYB domain (Kranz et al., 1998). AtMYB68, AtMYB84, and AtMYB36 were categorized as subgroup14 based on two shared motifs: S1: SFSQLLLDPN SEQ ID NO:266 and S2: TSTSADQSTISWEDI SEQ ID NO:267, at the C-terminus of the proteins. The homology at these motifs was limited, for example, Arabidopsis MYB36 has only 20% identity. Subsequently, AtMYB87, AtMYB37 and AtMYB38 were also included in subgroup14, on the basis of sequence conservation in the MYB DNA domain: R2 and R3 helix-turn-helix repeats (Stracke et al., 2001).

The R2R3 domains may be indicative of specific DNA binding through the unique amino acid sequence of the third helix of the R3 domain and minor conformational changes associated with the structural interaction between the first two helices. It suggests that subgroup14 members may be functionally redundant orthologous. For example, lateral meristem initiation in Arabidopsis was studied with respect to MYB-subgroup14 (Muller et al., 2006). All members of MYB-subgroup14 showed high similarity to the tomato Blind (B1) gene, a regulator of axillary meristems. Transcripts of four members: AtMYB37, AtMYB38, AtMYB84 and AtMYB87 were detected by RT-PCR in tissues including shoot tip, internode, leaf, flower bud, open flower, and root, whereas AtMYB36 and AtMYB68 expression was expressed in root tissue. Phenotypic analysis using knockouts of AtMYB37, AtMYB38 and AtMYB84 indicated that these members of MYB-subgroup14 at least partially redundant for regulating axillary bud formation.

MYB68 is a R2R3 type MYB gene, and a member of MYB-subgroup14, that has been identified in a transposon gene trapping study (Feng et al., 2004). Expression of this gene has been demonstrated to be specific to root pericycle cells. In the null mutant, no MYB68 mRNA was detectable; however, no mutant phenotype was exhibited when plants were grown under standard conditions. In the evaluation of MYB68 under a variety of growth conditions the only phenotype discerned was reduced plant leaf area when plants were grown under hot greenhouse conditions (30-40° C.). This phenotype was rescued by transformation of the myb68 mutant background with a wild-type MYB68 gene. Examination root tissue of the myb68 mutant grown in root cultures indicated increased biomass and lignin levels. The authors conclude that MYB68 is involved in root development (Feng et al., 2004).

Transcriptional activation is primarily mediated through transcription factors that interact with enhancer and promoter elements. Binding of transcription factors to such DNA elements constitutes a crucial step in transcriptional initiation. Each transcription factor binds to its specific binding sequence in a promoter and activates expression of the linked coding region through interactions with coactivators and/or proteins that are a part of the transcription complex.

SUMMARY OF THE INVENTION

This invention relates to a method for enhancing the heat stress tolerance of plants by means of increasing the expression of a MYB subgroup-14 polypeptide. Enhanced heat stress tolerance includes improved seed set during and following conditions of heat stress. Improved seed set results in increased yield. A MYB-subgroup-14 polypeptide includes for example a MYB68, a MYB36, a MYB84, a MYB37, a MYB38 or a MYB87 polypeptide. Preferably, the MYB-subgroup-14 polypeptide is a MYB68, a MYB36 or a MYB84 polypeptide. The MYB subgroup-14 polypeptide expression is ectopic, or constitutive. Alternatively, expression of the MYB subgroup-14 polypeptide in its typical place of expression, e.g. root tissue.

A heat stress heat stress tolerant plant is produced by providing a nucleic acid construct that increases the expression of a Myb subgroup-14 polypeptide, inserting the nucleic construct into a vector, transforming a plant, tissue culture, or a plant cell with the vector to obtain a plant, tissue culture or a plant cell with increased expression of the Myb subgroup-14 polypeptide and growing said plant or regenerating a plant from the tissue culture or plant cell. A nucleic acid construct that increases the expression of a Myb subgroup-14 polypeptide includes for example an enhancer element. An enhancer is a sequence found in eukaryotes and certain eukaryotic viruses which can increase transcription of a gene when located, in either orientation, up to several kilobases from the gene concerned. These sequences act as enhancers when on the 5′ side (upstream) of the gene in question. However, some enhancers are active when placed on the 3′ side (downstream) of the gene. The enhancer elements can activate transcription of a gene and alter the normal expression pattern of the endogenous gene. Enhancer elements are known to those skilled in the art. For example the enhancer element is a 35S enhancer element.

Additionally, a nucleic acid construct that increases the expression of a Myb subgroup-14 polypeptide includes for example a nucleic acid encoding a Myb subgroup-14 polypeptide. Exemplary, MYB polypeptides and nucleic acids include those of SEQ ID NO: 1-265. The nucleic acid encoding a Myb subgroup-14 polypeptide is operably linked to a promoter. The promoter is a heterologous promoter or a homologous promoter. Additionally, the promoter is a constitutive or an inducible promoter.

By increasing the expression of a MYB subgroup-14 polypeptide is meant that the amount produced by the cell transformed with the nucleic acid construct is greater than a cell, e.g. control cell that is not transformed with the nucleic acid construct. A control cell includes for example a cell that endogenously expresses a MYB subgroup-14 polypeptide such a plant root cell, alternatively a control cell is a non transformed cell of the same cell-type as the transformed cell, be it a leaf cell a meristem cell or a flower or seed cell. An increase is a 1-fold, 2-fold, 3 fold or greater increase. An increase of expression is also meant to include expression of a MYB subgroup-14 polypeptide in a cell that does not typically produced by a cell.

Also included in the invention is a method of identifying a heat stress tolerant plant. The plants identified by these methods have reduced flower abortion and increased yield as compared to a control plant. Heat stress tolerant plants are identified by exposing a population of flowering plants to a heat stress treatment and selecting a plant from the population of plants that has reduced flower abortion. Heat stress treatment includes for example exposing the plant to a temperature that is hot enough for a sufficient amount of time such that damage to plant functions or development results. By reduced flower abortion is meant that a plant does not loss as many flowers, due to flower abortion, or has a greater seed yield compared to another plant that is exposed to a similar level of heat stress. Plants with a reduced flower abortion have a 5, 10, 20, 25, 30% or more increase in seed yield as compared to a control plant.

The invention further includes the plants produced by the methods of the invention and the seed produced by the plants which produce a plant that has an increase tolerance to heat stress.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.

Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.

DETAILED DESCRIPTION

The invention is based upon the surprising discovery of plants that have an increased tolerance to heat stress which results in an increased yield relative to a wild-type control. More specifically, the invention is based upon the discovery that increasing the expression of a MYB-subgroup14 polypeptide (e.g., MYB68) results in a plant having an increased resistance to heat stress.

Expression of a MYB-subgroup 14 polypeptide can be accomplished for example by increasing the expression of an endogenous MYB-subgroup 14 polypeptide (e.g., activation tag insertion) or by expression of an exogenous gene construct encoding for a MYB-subgroup 14 polypeptide. The gene encoding for the MYB-subgroup 14 polypeptide may be endogenous or exogenous to the transformed species. As shown in the EXAMPLES plants having an increases resistance to heat stress were produced not only transforming a plant with its native MYB-subgroup 14 polypeptide but also with a MYB-subgroup 14 polypeptide from another plant species.

Accordingly the invention provides methods of enhancing (e.g, increasing) the heat stress tolerance of plants by increasing the expression of a MYB subgroup-14 polypeptide. Also included in the invention is a method of identifying a heat stress tolerant plant. The plants identified by these methods have reduced flower abortion and increased yield as compared to a control plant. Heat stress tolerant plants are identified by exposing the population of flowering plants to a heat stress treatment and selecting a plant from the population of plants that has reduced flower abortion. The invention also includes the transgenic plants produced by the methods of the invention and the seeds produced by the transgenic plants that produce a heat stress tolerant plant.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In the case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.

For convenience, before further description of the present invention, certain terms employed in the specification, examples and appended claims are defined herein. These definitions should be read in light of the remainder of the disclosure and as understood by a person of ordinary skill in the art.

The term “constitutive expression” means expression of a gene in any cell at constant levels in a non-regulated manner.

The terms “cMYB” and “MYB” refer to a cDNA clone of MYB and are used interchangeably. Where a genomic sequence has been used or referred to, it is identified and differentiated by the term “gMYB” or “genomic MYB” thereby referring to a genomic MYB sequence.

The term “ectopic expression” means expression of a gene in an abnormal place in an organism relative to the endogenous gene expression. Ectopic expression may include constitutively expressed genes depending on the native expression patterns of a given gene.

The term “expression cassette” means a vector construct wherein a gene is transcribed. Additionally, the expressed mRNA may be translated into a polypeptide.

The terms “expression” or “over-expression” are used interchangeably and means the expression of a gene such that the transgene is expressed. The total level of expression in a cell may be elevated relative to a wild-type cell.

“Flower abortion” means a flower that fails to develop and produce a fruit or seed. In addition to premature senescence of a flower, flower abortion may refer to loss of pollen production, altered pollination or fertilization and subsequent seed development. Altered growth and development of meristem tissue, a flower meristem in particular, is further included within the meaning of flower abortion.

The term “heat tolerance” is defined as a phenotype where a first plant, or plant line, has increased capacity to withstand elevated temperature and produce a yield that is in excess of a second plant or plant line, the second plant line being a control plant such as a wild-type control plant line.

A “promoter sequence”, or “promoter”, means a nucleic acid sequence capable of inducing transcription of an operably linked gene sequence in a plant cell.

The term “seed set” is seed formation as a result of flower pollination followed by egg cell fertilization and zygote development. Reductions in seed set which can occur due to interruption in any of the above processes will produce a net reduction in seed number produced.

The term “substantially similar” refers to nucleic acids where a change in one or more nucleotides does not alter the functional properties of the nucleic acid or the encoded polypeptide. Due to the degeneracy of the genetic code, a base pair change can result in no change in the encoded amino acid sequence. For example, the codons ACT, ACC, ACA and ACG all encode a threonine amino acid. Alternatively one or more base pair changes may alter the encoded amino acid however if the substituted amino acid has similar chemical properties functionality of the encoided protein is likely to be unaffected. For example, threonine codons ACT and ACC when changed to AGT or AGC respectively encode for serine, a chemically and biologically similar amino acid. Additionally, certain amino acids within a polypeptide are non essential and alterations may be made in these locations without an effect on the functionality of the polypeptide. Substantially similar also refers to sequences having changes at one or more nucleotide bases wherein the changes do not affect the ability of the sequence to alter gene expression by various gene silencing methodologies such as antisense, RNAi or co-suppression. The term “substantially similar” refers to polypeptides wherein a change in one or more amino acids does not alter the functional properties of the polypeptide as discussed above.

The term “yield” refers to seed number, seed weight, seed size, total plant biomass, increased biomass of a plant organ, such as stems or leaves or roots, fruit production, and flower production,

The term “yield protection” is defined as the positive difference, expressed as a % value, between the yield of the transgenic or mutant and the control, where the yield is expressed as a % of optimal, following an imposed stress. The calculation is done by comparing the optimal yield with that after the stress treatment (stress yield/optimal yield×100).

The MYB gene family is classified based on sequence homology and the presence of defined domains and motifs such as an R2R3 domain. The classification in all cases is not absolute and varies depending on the criteria selected for the analysis (Krantz et al 1998, Stracke et al 2001).

Herein we define the MYB-subgroup14 to include at least the following members, MYB68, MYB36, MYB84, MYB37, MYB38 and MYB87. The Arabidopsis MYB68, MYB36, MYB84, MYB37, MYB38 and MYB87 sequences are used to identify homologues from other species according to the methods herein, examples of which are included in Table 1

The term “MYB sequence” refers to a polynucleotide sequence or a polypeptide sequence as contextually appropriate.

Sequences

The following sequences from the MYB-subgroup14 family, and corresponding sequence identifiers, are employed throughout the specification, examples and appended claims:

TABLE 1 SEQ ID Accession Number MYB NO: SPECIES Reference Identification 1 ARABIDOPSIS THALIANA NM_125976.2 MYB68 NT 2 ARABIDOPSIS THALIANA NP_201380.1 MYB68 AA 3 ARABIDOPSIS THALIANA NM_114829.3 MYB84 NT 4 ARABIDOPSIS THALIANA NP_190538.1 MYB84 AA 5 ARABIDOPSIS THALIANA NM_125143.3 MYB36 NT 6 ARABIDOPSIS THALIANA NP_200570.1 MYB36 AA 7 BRASSICA RAPA MYB68 NT 8 BRASSICA RAPA MYB68 AA 9 ORYZA SATIVA NM_001057941.1 MYB36 NT 10 ORYZA SATIVA AAT85046.1 MYB36 AA 11 GOSSYPIUM TC34239 MYB68 NT 12 GOSSYPIUM TC34239_ORF MYB68 AA 13 GLYCINE MAX DQ822965.1 MYB84 NT 14 GLYCINE MAX ABH02906.1 MYB84 AA 15 GLYCINE MAX MYB84 NT 16 GLYCINE MAX MYB84 AA 17 ZEA MAYS TC370133 MYB84 NT 18 ZEA MAYS TC370133_ORF MYB84 AA 19 SORGHUM BICOLOR AF474127 MYB36 NT 20 SORGHUM BICOLOR AAL84760.1 MYB36 AA 21 TRITICUM AESTIVUM BQ483726 MYB84 NT 22 TRITICUM AESTIVUM BQ483726_ORF MYB84 AA 23 POPULUS TC54478 MYB84 NT 24 POPULUS TC54478_ORF MYB84 AA 25 MEDICAGO TC97441 MYB68 NT TRUNCATULA 26 MEDICAGO TC97441_ORF MYB68 AA TRUNCATULA 27 SOLANUM AF426174.1 MYB36 NT LYCOPERSICUM 28 SOLANUM AAL69334.1 MYB36 AA LYCOPERSICUM 29 SOLANUM BG134669 MYB36 NT LYCOPERSICUM 30 SOLANUM BG134669_ORF MYB36 AA LYCOPERSICUM 31 ARABIDOPSIS THALIANA NM_119940.3 MYB87 NT 32 ARABIDOPSIS THALIANA NP_195492.2 MYB87 AA 33 ARABIDOPSIS THALIANA NM_122206.3 MYB37 NT 34 ARABIDOPSIS THALIANA NP_197691.1 MYB37 AA 35 ARABIDOPSIS THALIANA NM_129245.2 MYB38 NT 36 ARABIDOPSIS THALIANA NP_181226.1 MYB38 AA 37 AEGILOPS SPELTOIDES BQ841600.1 MYB36 NT 38 ANTIRRHINUM MAJUS AJ794728.1 MYB68 NT 39 ANTIRRHINUM MAJUS AJ794728.1_ORF MYB68 AA 40 AQUILEGIA TC13008 MYB84 NT 41 AQUILEGIA TC13008_ORF MYB84 AA 42 AQUILEGIA TC11167 MYB36 NT 43 AQUILEGIA TC11167_ORF MYB36 AA 44 ARACHIS HYPOGAEA CD038321.1 MYB68 NT 45 ARACHIS HYPOGAEA CD038321.1_ORF MYB68 AA 46 ARACHIS HYPOGAEA ES761155.1 MYB68 NT 47 ARACHIS HYPOGAEA ES761155.1_ORF MYB68 AA 48 ARACHIS STENOSPERMA EH046152.1 MYB36 NT 49 ARACHIS STENOSPERMA EH046152.1_ORF MYB36 AA 50 BRACHYPODIUM DV486330.1 MYB38 NT DISTACHYON 51 BRACHYPODIUM DV486330.1_ORF MYB38 AA DISTACHYON 52 BRACHYPODIUM DV488965.1 MYB38 NT DISTACHYON 53 BRACHYPODIUM DV488965.1_ORF MYB38 AA DISTACHYON 54 BRASSICA NAPUS (bud) MYB68 NT 55 BRASSICA NAPUS (bud) MYB68 AA 56 BRASSICA NAPUS (root) MYB68 NT 57 BRASSICA NAPUS (root) MYB68 AA 58 BRASSICA NAPUS TC40384 MYB68 NT 59 BRASSICA NAPUS ES900275.1 MYB68 NT 60 BRASSICA NAPUS ES900275.1_ORF MYB68 AA 61 BRASSICA NAPUS TC55899 MYB38 NT 62 BRASSICA NAPUS TC55899_ORF MYB38 AA 63 BRASSICA RAPA EX134980.1 MYB68 NT 64 BRASSICA RAPA EX134980.1_ORF MYB68 AA 65 BRASSICA RAPA EX137439.1 MYB68 NT 66 BRASSICA RAPA EX137439.1_ORF MYB68 AA 67 CARTHAMUS EL384492.1 MYB36 NT TINCTORIUS 68 CARTHAMUS EL384492.1_ORF MYB36 AA TINCTORIUS 69 CARTHAMUS EL392277.1 MYB36 NT TINCTORIUS 70 CARTHAMUS EL392277.1_ORF MYB36 AA TINCTORIUS 71 CENTAUREA MACULOSA EH724496.1 MYB36 NT 72 CENTAUREA MACULOSA EH724496.1_ORF MYB36 AA 73 CENTAUREA MACULOSA EH719165.1 MYB36 NT 74 CENTAUREA MACULOSA EH719165.1_ORF MYB36 AA 75 CENTAUREA MACULOSA EH724438.1 MYB68 NT 76 CENTAUREA MACULOSA EH724438.1_ORF MYB68 AA 77 CENTAUREA EH774519.1 MYB68 NT SOLSTITIALIS 78 CENTAUREA EH774519.1_ORF MYB68 AA SOLSTITIALIS 79 CENTAUREA EH771972.1 MYB68 NT SOLSTITIALIS 80 CENTAUREA EH771972.1_ORF MYB68 AA SOLSTITIALIS 81 CENTAUREA EH768792.1 MYB84 NT SOLSTITIALIS 82 CENTAUREA EH768792.1_ORF MYB84 AA SOLSTITIALIS 83 CICHORIUM ENDIVIA EL361859.1 MYB84 NT 84 CICHORIUM ENDIVIA EL361859.1_ORF MYB84 AA 85 CICHORIUM INTYBUS EH681135.1 MYB38 NT 86 CICHORIUM INTYBUS EH681135.1_ORF MYB38 AA 87 CICHORIUM INTYBUS EH694860.1 MYB68 NT 88 CITRUS SINENSIS CK936024.1 MYB68 NT 89 CITRUS SINENSIS CK936024.1_ORF MYB68 AA 90 COFFEA CANEPHORA DV692261.1 MYB37 NT 91 COFFEA CANEPHORA DV691112.1 MYB37 NT 92 CUCUMIS MELO AM727197.2 MYB36 NT 93 CUCUMIS MELO AM727197.2_ORF MYB36 AA 94 CUCUMIS MELO AM716075.2 MYB36 NT 95 CUCUMIS MELO AM716075.2_ORF MYB36 AA 96 DAUCUS CAROTA AB298508.1 MYB68 NT 97 DAUCUS CAROTA BAF49444.1 MYB68 AA 98 ELAEIS GUINEENSIS EL690464.1 MYB84 NT 99 ELAEIS GUINEENSIS EL690464.1_ORF MYB84 AA 100 ELAEIS OLEIFERA ES370938.1 MYB84 NT 101 ELAEIS OLEIFERA ES370938.1_ORF MYB84 AA 102 ESCHSCHOLZIA CD480801.1 MYB68 NT CALIFORNICA 103 ESCHSCHOLZIA CD480801.1_ORF MYB68 AA CALIFORNICA 104 EUPHORBIA ESULA DV138530.1 MYB84 NT 105 EUPHORBIA ESULA DV138530.1_ORF MYB84 AA 106 EUPHORBIA ESULA DV126436.1 MYB36 NT 107 EUPHORBIA ESULA DV126436.1_ORF MYB36 AA 108 EUPHORBIA TIRUCALLI BP958179.1 MYB84 NT 109 GINKGO BILOBA EX940876.1 MYB68 NT 110 GLYCINE MAX MYB84 NT 111 GLYCINE MAX MYB84 AA 112 GLYCINE MAX TC213651 MYB84 NT 113 GLYCINE MAX TC213651_ORF MYB84 AA 114 GLYCINE MAX DQ822971.1 MYB36 NT 115 GLYCINE MAX ABH02912.1 MYB36 AA 116 GLYCINE MAX TC211227 MYB36 NT 117 GLYCINE MAX TC211227_ORF MYB36 AA 118 GOSSYPIUM TC62721 MYB68 NT 119 GOSSYPIUM TC62721_ORF MYB68 AA 120 GOSSYPIUM DW491290.1 MYB36 NT 121 GOSSYPIUM DW491290.1_ORF MYB36 AA 122 HEDYOTIS TERMINALIS CB077617.1 MYB84 NT 123 HEDYOTIS TERMINALIS CB077617.1_ORF MYB84 AA 124 HELIANTHUS ANNUUS BQ967558 MYB36 NT 125 HELIANTHUS EE621630.1 MYB36 NT ARGOPHYLLUS 126 HELIANTHUS EE621630.1_ORF MYB36 AA ARGOPHYLLUS 127 HELIANTHUS EE619500.1 MYB36 NT ARGOPHYLLUS 128 HELIANTHUS EE619500.1_ORF MYB36 AA ARGOPHYLLUS 129 HELIANTHUS CILIARIS EL422629.1 MYB68 NT 130 HELIANTHUS EXILIS EE645503.1 MYB68 NT 131 HELIANTHUS EXILIS EE645503.1_ORF MYB68 AA 132 HELIANTHUS EXILIS EE646813.1 MYB36 NT 133 HELIANTHUS EXILIS EE646813.1_ORF MYB36 AA 134 HELIANTHUS EL474327.1 MYB84 NT PARADOXUS 135 HELIANTHUS PETIOLARIS DY942970.1 MYB84 NT 136 HELIANTHUS PETIOLARIS DY942970.1_ORF MYB84 AA 137 HELIANTHUS PETIOLARIS DY953493.1 MYB68 NT 138 HELIANTHUS PETIOLARIS DY953493.1_ORF MYB68 AA 139 HELIANTHUS EL445341.1 MYB36 NT TUBEROSUS 140 HELIANTHUS EL445341.1_ORF MYB36 AA TUBEROSUS 141 HORDEUM VULGARE BY845215.1 MYB38 NT 142 HORDEUM VULGARE BY845215.1_ORF MYB38 AA 143 HUMULUS LUPULUS AJ876882.1 MYB36 NT 144 HUMULUS LUPULUS CAI46244.1 MYB36 AA 145 LACTUCA PERENNIS DW092247.1 MYB84 NT 146 LACTUCA PERENNIS DW092247.1_ORF MYB84 AA 147 LACTUCA SALIGNA DW065247.1 MYB68 NT 148 LACTUCA SALIGNA DW065247.1_ORF MYB68 AA 149 LACTUCA SATIVA DY960463.1 MYB38 NT 150 LACTUCA SATIVA DY960463.1_ORF MYB38 AA 151 LACTUCA SATIVA DY969483.1 MYB38 NT 152 LACTUCA SATIVA DY969483.1_ORF MYB38 AA 153 LACTUCA SATIVA DY980672.1 MYB38 NT 154 LACTUCA SATIVA DY980672.1_ORF MYB38 AA 155 LACTUCA SERRIOLA DW108054.1 MYB38 NT 156 LACTUCA VIROSA DW160139.1 MYB84 NT 157 LACTUCA VIROSA DW160139.1_ORF MYB84 AA 158 LACTUCA VIROSA DW160891.1 MYB38 NT 159 LACTUCA VIROSA DW160891.1_ORF MYB38 AA 160 LIRIODENDRON CO998829.1 MYB38 NT TULIPIFERA 161 LIRIODENDRON CO998829.1_ORF MYB38 AA TULIPIFERA 162 MALUS DOMESTICA DT002401.1 MYB36 NT 163 MALUS DOMESTICA DT002401.1_ORF MYB36 AA 164 MALUS DOMESTICA DQ074472.1 MYB38 NT 165 MALUS DOMESTICA AAZ20440.1 MYB38 AA 166 MANIHOT ESCULENTA DB936694.1 MYB68 NT 167 MANIHOT ESCULENTA DB936694.1_ORF MYB68 AA 168 MARCHANTIA BJ846153.1 MYB84 NT POLYMORPHA 169 MARCHANTIA BJ846153.1_ORF MYB84 AA POLYMORPHA 170 MEDICAGO TC110497 MYB36 NT TRUNCATULA 171 MEDICAGO TC110497_ORF MYB36 AA TRUNCATULA 172 MEDICAGO BF634640 MYB84 NT TRUNCATULA 173 MEDICAGO BF634640_ORF MYB84 AA TRUNCATULA 174 NUPHAR ADVENA CD472544.1 MYB36 NT 175 NUPHAR ADVENA CD472544.1_ORF MYB36 AA 176 ORYZA SATIVA LOC_Os01g09590.1_cds MYB38 NT 177 ORYZA SATIVA LOC_Os01g09590.1 MYB38 AA 178 ORYZA SATIVA LOC_Os01g49160.1_cds MYB36 NT 179 ORYZA SATIVA LOC_Os01g49160.1 MYB36 AA 180 ORYZA SATIVA LOC_Os01g52410.1_cds MYB38 NT 181 ORYZA SATIVA LOC_Os01g52410.1 MYB38 AA 182 ORYZA SATIVA LOC_Os02g54520.1_cds MYB36 NT 183 ORYZA SATIVA LOC_Os02g54520.1 MYB36 AA 184 ORYZA SATIVA LOC_Os05g48010.1_cds MYB36 NT 185 ORYZA SATIVA LOC_Os05g48010.1 MYB36 AA 186 ORYZA SATIVA LOC_Os08g15020.1_cds MYB36 NT 187 ORYZA SATIVA LOC_Os08g15020.1 MYB36 AA 188 ORYZA SATIVA LOC_Os09g26170.1_cds MYB36 NT 189 ORYZA SATIVA LOC_Os09g26170.1 MYB36 AA 190 ORYZA SATIVA LOC_Os10g35660.1_cds MYB36 NT 191 ORYZA SATIVA LOC_Os10g35660.1 MYB36 AA 192 PICEA EX361512.1 MYB68 NT 193 PICEA EX361512.1_ORF MYB68 AA 194 PICEA TC20498 MYB68 NT 195 PICEA TC20498_ORF MYB68 AA 196 PINUS DR015810 MYB84 NT 197 PINUS DR015810_ORF MYB84 AA 198 PINUS TC66643 MYB68 NT 199 PINUS TC66643_ORF MYB68 AA 200 PONCIRUS TRIFOLIATA CD575120.1 MYB38 NT 201 PONCIRUS TRIFOLIATA CD575120.1_ORF MYB38 AA 202 POPULUS Gw1.II.96.1 MYB68 NT 203 POPULUS Gw1.II.96.1_ORF MYB68 AA 204 POPULUS DB879439.1 MYB84 NT 205 POPULUS DB879439.1_ORF MYB84 AA 206 POPULUS TC74579 MYB36 NT 207 POPULUS TC74579_ORF MYB36 AA 208 QUERCUS PETRAEA CU639795.1 MYB36 NT 209 QUERCUS PETRAEA CU639795.1_ORF MYB36 AA 210 QUERCUS SUBER EE743680.1 MYB84 NT 211 RAPHANUS FD544184.1 MYB68 NT RAPHANISTRUM 212 RAPHANUS FD544184.1_ORF MYB68 AA RAPHANISTRUM 213 RAPHANUS EY915531.1 MYB68 NT RAPHANISTRUM 214 RAPHANUS FD540311.1 MYB68 NT RAPHANISTRUM 215 RAPHANUS FD540311.1_ORF MYB68 AA RAPHANISTRUM 216 RAPHANUS EV548164.1 MYB38 NT RAPHANISTRUM 217 RAPHANUS SATIVUS FD580369.1 MYB68 NT 218 ROSA HYBRID EC587279.1 MYB68 NT 219 ROSA HYBRID EC587279.1_ORF MYB68 AA 220 SACCHARUM CA150911 MYB36 NT OFFICINARUM 221 SACCHARUM CA150911_ORF MYB36 AA OFFICINARUM 222 SACCHARUM CA258665 MYB84 NT OFFICINARUM 223 SACCHARUM CA258665_ORF MYB84 AA OFFICINARUM 224 SACCHARUM TC44677 MYB36 NT OFFICINARUM 225 SACCHARUM TC44677_ORF MYB36 AA OFFICINARUM 226 SECALE CEREALE BE495537 MYB38 NT 227 SECALE CEREALE BE495537_ORF MYB38 AA 228 SOLANUM TC182203 MYB36 NT LYCOPERSICUM 229 SOLANUM TC182203_ORF MYB36 AA LYCOPERSICUM 230 SOLANUM TUBEROSUM AM907873.1 MYB36 NT 231 SOLANUM TUBEROSUM AM907873.1_ORF MYB36 AA 232 SORGHUM BICOLOR TC98185 MYB36 NT 233 SORGHUM BICOLOR TC98185_ORF MYB36 AA 234 SORGHUM BICOLOR TC101637 MYB36 NT 235 SORGHUM BICOLOR TC101637_ORF MYB36 AA 236 SORGHUM PROPINQUUM BG560270.1 MYB36 NT 237 SORGHUM PROPINQUUM BG560270.1_ORF MYB36 AA 238 TARAXACUM DY830100.1 MYB68 NT OFFICINALE 239 TARAXACUM DY830100.1_ORF MYB68 AA OFFICINALE 240 TRIPHYSARIA PUSILLA EY172046.1 MYB68 NT 241 TRIPHYSARIA PUSILLA EY172046.1_ORF MYB68 AA 242 TRIPHYSARIA PUSILLA EY179359.1 MYB36 NT 243 TRIPHYSARIA PUSILLA EY179359.1_ORF MYB36 AA 244 TRIPHYSARIA PUSILLA EY174724.1 MYB38 NT 245 TRIPHYSARIA PUSILLA EY174724.1_ORF MYB38 AA 246 TRIPHYSARIA EX989121.1 MYB38 NT VERSICOLOR 247 TRIPHYSARIA EY018825.1 MYB38 NT VERSICOLOR 248 TRIPHYSARIA EY018825.1_ORF MYB38 AA VERSICOLOR 249 TRITICUM AESTIVUM MYB84 NT 250 TRITICUM AESTIVUM MYB84 AA 251 VACCINIUM CV090776.1 MYB36 NT CORYMBOSUM 252 VACCINIUM CV090776.1_ORF MYB36 AA CORYMBOSUM 253 VITIS VINIFERA CAO70108.1_cds MYB84 NT 254 VITIS VINIFERA CAO70108.1 MYB84 AA 255 VITIS VINIFERA CAO43296.1_cds MYB36 NT 256 VITIS VINIFERA CAO43296.1 MYB36 AA 257 VITIS VINIFERA CAO61524.1_cds MYB84 NT 258 VITIS VINIFERA CAO61524.1 MYB84 AA 259 VITIS VINIFERA DT006424 MYB36 NT 260 VITIS VINIFERA DT006424_ORF MYB36 AA 261 ZEA MAYS MYB36 NT 262 ZEA MAYS MYB36 AA 263 ZEA MAYS TC320820 MYB36 NT 264 ZEA MAYS TC320820_ORF MYB36 AA 265 ORYZA SATIVA MYB36 NT Determining Homology Between Two or More Sequences

To determine the percent homology of two amino acid sequences or of two nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in either of the sequences being compared for optimal alignment between the sequences). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are homologous at that position (i.e., as used herein amino acid or nucleic acid “homology” is equivalent to amino acid or nucleic acid “identity”).

The nucleic acid sequence homology may be determined as the degree of identity between two sequences. The homology may be determined using computer programs known in the art, such as GAP software provided in the GCG program package. See, Needleman and Wunsch 1970 J Mol Biol 48: 443-453. Using GCG GAP software with the following settings for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the coding sequence (encoding) part of the DNA sequence shown in Table 1.

The term “sequence identity” refers to the degree to which two polynucleotide or polypeptide sequences are identical on a residue-by-residue basis over a particular region of comparison. The term “percentage of sequence identity” is calculated by comparing two optimally aligned sequences over that region of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the region of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. The term “substantial identity” as used herein denotes a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually at least 99 percent sequence identity as compared to a reference sequence over a comparison region. The term “percentage of positive residues” is calculated by comparing two optimally aligned sequences over that region of comparison, determining the number of positions at which the identical and conservative amino acid substitutions, as defined above, occur in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the region of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of positive residues.

Recombinant Expression Vectors and Host Cells

Another aspect of the invention pertains to vectors, preferably expression vectors, containing a nucleic acid encoding a MYB-subgroup14 protein, gene, analogs or homologs thereof. The sequence encoding a MYB-subgroup14 polypeptide may be a genomic sequence or a cDNA sequence. As used herein the term expression vector includes vectors which are designed to provide transcription of the nucleic acid sequence. The transcribed nucleic acid may be translated into a polypeptide or protein product. As used herein, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a “plasmid”, which refers to a circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication). Other vectors are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors”. In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, “plasmid” and “vector” can be used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors, such as viral vectors or plant transformation vectors, binary or otherwise, which serve equivalent functions.

The recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably-linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).

The term “regulatory sequence” is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences) or inducible promoters (e.g., induced in response to abiotic factors such as environmental conditions, heat, drought, nutrient status or physiological status of the cell or biotic such as pathogen responsive). Examples of suitable promoters include for example constitutive promoters, ABA inducible promoters, tissue specific promoters and abiotic or biotic inducible promoters. It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired as well as timing and location of expression, etc. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., MYB-subgroup14 proteins such as MYB68 proteins, mutant forms of MYB68 proteins, fusion proteins, etc.).

The recombinant expression vectors of the invention can be designed for expression of a MYB-subgroup14 gene or a MYB-subgroup14 protein in prokaryotic or eukaryotic cells. For example, a MYB-subgroup14 gene or a MYB-subgroup14 protein can be expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression vectors) yeast cells, plant cells or mammalian cells. Suitable host cells are discussed further in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.

In one embodiment, a nucleic acid of the invention is expressed in plants cells using a plant expression vector. Examples of plant expression vectors systems include tumor inducing (Ti) plasmid or portion thereof found in Agrobacterium, cauliflower mosaic virus (CAMV) DNA and vectors such as pBI121, a pCAMBA series vector or one of preferred choice to a person skilled in the art.

For expression in plants, the recombinant expression cassette will contain in addition to a MYB-subgroup14 polynucleotide, a promoter region functional in a plant cell, a transcription initiation site (if the coding sequence to transcribed lacks one), and a transcription termination/polyadenylation sequence. The termination/polyadenylation region may be obtained from the same gene as the promoter sequence or may be obtained from different genes. Unique restriction enzyme sites at the 5′ and 3′ ends of the cassette are typically included to allow for easy insertion into a pre-existing vector.

Examples of suitable plant expressible promoters include promoters from plant viruses such as the 35S promoter from cauliflower mosaic virus (CaMV) (Odell, et al., Nature, 313: 810-812 (1985)), promoters from genes such as rice actin (McElroy, et al., Plant Cell, 163-171 (1990)), ubiquitin (Christensen, et al., Plant Mol. Biol., 12: 619-632 (1992); and Christensen, et al., Plant Mol. Biol., 18: 675-689 (1992)), pEMU (Last, et al., Theor. Appl. Genet., 81: 581-588 (1991)), MAS (Velten, et al., EMBO J., 3: 2723-2730 (1984)), maize 113 histone (Lepetit, et al., Mol. Gen. Genet., 231: 276-285 (1992); and Atanassvoa, et al., Plant Journal, 2(3): 291-300 (1992)), the 5′- or 3′-promoter derived from T-DNA of Agrobacterium tumefaciens, the Smas promoter, the cinnamyl alcohol dehydrogenase promoter (U.S. Pat. No. 5,683,439), the Nos promoter, the rubisco promoter, the GRP1-8 promoter, ALS promoter, (WO 96/30530), a synthetic promoter, such as Rsyn7, SCP and UCP promoters, ribulose-1,3-diphosphate carboxylase, fruit-specific promoters, heat shock promoters, seed-specific promoters and other transcription initiation regions from various plant genes, for example, including the various opine initiation regions, such as for example, octopine, mannopine, and nopaline. Useful promoters also include heat inducible promoters such as the HSP18.2 or HSP81.1 promoters (Takahashi et al. 1992, Plant J. 2, 751-761; Yoshida et al., 1995, Appl. Microbiol. Biotechnol. 44, 466-472; Ueda et al., 1996, Mol Gen Genet. 250, 533-539). Cryptic promoters are also useful for chimeric constructs useful in the invention. Cryptic gene regulatory elements are inactive at their native locations in the genome but are fully functional when positioned adjacent to genes in transgenic plants.

In addition to chimeric promoter-gene constructs the use of a native MYB-subgroup14 promoter is contemplated. Expression characteristics of a native promoter may be modified by inclusion of regulatory elements such that expression levels are elevated and or expressed ectopically and or constitutively. For example, a 4×35S enhancer sequence (Wiegel et al., 2000) may be included in a construct to enhance expression. Alternatively a population of plants may be produced by transformation with a construct having a 4×35S enhancer sequence, such as, a pSKI15 vector as per Wiegel et al., 2000. The transformed population can be screened for plants having increased expression of a MYB-subgroup14 sequence, or screened for plants having increased heat tolerance and reduced flower abortion, or a combination of such screens to identify a plant of interest.

Additional regulatory elements that may be connected to a MYB-subgroup14 encoding nucleic acid sequence for expression in plant cells include terminators, polyadenylation sequences, and nucleic acid sequences encoding signal peptides that permit localization within a plant cell or secretion of the protein from the cell. Such regulatory elements and methods for adding or exchanging these elements with the regulatory elements of a MYB-subgroup14 gene are known, and include, but are not limited to, 3′ termination and/or polyadenylation regions such as those of the Agrobacterium tumefaciens nopaline synthase (nos) gene (Bevan, et al., Nucl. Acids Res., 12: 369-385 (1983)); the potato proteinase inhibitor II (PINII) gene (Keil, et al., Nucl. Acids Res., 14: 5641-5650 (1986) and hereby incorporated by reference); and An, et al., Plant Cell, 1: 115-122 (1989)); and the CaMV 19S gene (Mogen, et al., Plant Cell, 2: 1261-1272 (1990)).

Plant signal sequences, including, but not limited to, signal-peptide encoding DNA/RNA sequences which target proteins to the extracellular matrix of the plant cell (Dratewka-Kos, et al., J. Biol. Chem., 264: 4896-4900 (1989)) and the Nicotiana plumbaginifolia extension gene (DeLoose, et al., Gene, 99: 95-100 (1991)), or signal peptides which target proteins to the vacuole like the sweet potato sporamin gene (Matsuka, et al., Proc. Nat'l Acad. Sci. (USA), 88: 834 (1991)) and the barley lectin gene (Wilkins, et al., Plant Cell, 2: 301-313 (1990)), or signals which cause proteins to be secreted such as that of PIM (Lind, et al., Plant Mol. Biol., 18: 47-53 (1992)), or those which target proteins to the plastids such as that of rapeseed enoyl-ACP reductase (Verwaert, et al., Plant Mol. Biol., 26: 189-202 (1994)) are useful in the invention.

In another embodiment, the recombinant expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Especially useful in connection with the nucleic acids of the present invention are expression systems which are operable in plants. These include systems which are under control of a tissue-specific promoter, as well as those which involve promoters that are operable in all plant tissues.

Organ-specific promoters are also well known. For example, the chalcone synthase-A gene (van der Meer et al., 1990, Plant Molecular Biology 15(1):95-109) or the dihydroflavonol-4-reductase (dfr) promoter (Elomaa et al., The Plant Journal, 16(1) 93-99) direct expression in specific floral tissues. Also available are the patatin class I promoter is transcriptionally activated only in the potato tuber and can be used to target gene expression in the tuber (Bevan, M., 1986, Nucleic Acids Research 14:4625-4636). Another potato-specific promoter is the granule-bound starch synthase (GBSS) promoter (Visser, R. G. R, et al., 1991, Plant Molecular Biology 17:691-699).

Other organ-specific promoters appropriate for a desired target organ can be isolated using known procedures. These control sequences are generally associated with genes uniquely expressed in the desired organ. In a typical higher plant, each organ has thousands of mRNAs that are absent from other organ systems (reviewed in Goldberg, P., 1986, Trans. R. Soc. London B314:343).

The resulting expression system or cassette is ligated into or otherwise constructed to be included in a recombinant vector which is appropriate for plant transformation. The vector may also contain a selectable marker gene by which transformed plant cells can be identified in culture. The marker gene may encode antibiotic resistance. These markers include resistance to G418, hygromycin, bleomycin, kanamycin, and gentamicin. Alternatively the marker gene may encode a herbicide tolerance gene that provides tolerance to glufosinate or glyphosate type herbicides. After transforming the plant cells, those cells having the vector will be identified by their ability to grow on a medium containing the particular antibiotic or herbicide. Replication sequences, of bacterial or viral origin, are generally also included to allow the vector to be cloned in a bacterial or phage host, preferably a broad host range prokaryotic origin of replication is included. A selectable marker for bacteria should also be included to allow selection of bacterial cells bearing the desired construct. Suitable prokaryotic selectable markers also include resistance to antibiotics such as kanamycin or tetracycline.

Other DNA sequences encoding additional functions may also be present in the vector, as is known in the art. For instance, in the case of Agrobacterium transformations, T-DNA sequences will also be included for subsequent transfer to plant chromosomes.

Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced. The terms “host cell” and “recombinant host cell” are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein. Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms “transformation” and “transfection” are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell.

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) a polypeptide of the invention encoded in an open reading frame of a polynucleotide of the invention. Accordingly, the invention further provides methods for producing a polypeptide using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding a polypeptide of the invention has been introduced) in a suitable medium such that the polypeptide is produced. In another embodiment, the method further comprises isolating the polypeptide from the medium or the host cell.

A number of cell types may act as suitable host cell for expression of a polypeptide encoded by an open reading frame in a polynucleotide of the invention. Plant host cells include, for example, plant cells that could function as suitable hosts for the expression of a polynucleotide of the invention include epidermal cells, mesophyll and other ground tissues, and vascular tissues in leaves, stems, floral organs, and roots from a variety of plant species, for example Arabidopsis, Brassica, Oryza, Zea, Sorghum, Gossypium, Triticum, Glycine, Pisum, Phaseolus, Lycopersicon, Trifolium, Cannabis, Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Lolium, Avena, Hordeum, Secale, Picea, Caco, and Populus.

Conservative Mutations

In addition to naturally-occurring allelic variants of a MYB-subgroup14 or a MYB68 sequence that may exist in the population, the skilled artisan will further appreciate that changes can be introduced by mutation into the nucleotide sequence of SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 thereby leading to changes in the amino acid sequence of the encoded MYB-subgroup14 or a MYB68 protein, without altering the functional ability of the MYB-subgroup14 or a MYB68 protein. For example, nucleotide substitutions leading to amino acid substitutions at “non-essential” amino acid residues can be made in the sequence of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264. A “non-essential” amino acid residue is a residue that can be altered from the wild-type sequence of a MYB-subgroup14 or a MYB68 without altering the biological activity, whereas an “essential” amino acid residue is required for biological activity. For example, amino acid residues that are conserved among MYB-subgroup14 or MYB68 proteins of the present invention are predicted to be poor candidates for alteration. Alignments and identification of conserved regions are described herein and provide further guidance as to identification of essential amino acids and conserved amino acids.

Another aspect of the invention pertains to nucleic acid molecules encoding a MYB-subgroup14 or MYB68 protein that contain changes in amino acid residues that are not essential for activity. Such MYB-subgroup14 or MYB68 proteins differ in amino acid sequence from SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264 yet retain biological activity. In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino acid sequence at least about 75% homologous to the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264. Preferably, the protein encoded by the nucleic acid is at least about 80% homologous to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264 more preferably at least about 90%, 95%, 98%, and most preferably at least about 99% homologous to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264.

An isolated nucleic acid molecule encoding a MYB-subgroup14 or a MYB68 protein homologous to the protein of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264 can be created by introducing one or more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 such that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein.

Mutations can be introduced into the nucleotide sequence of SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A “conservative amino acid substitution” is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in MYB68 is replaced with another amino acid residue from the same side chain family. Alternatively, in another embodiment, mutations can be introduced randomly along all or part of a MYB-subgroup14 or a MYB68 coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for biological activity to identify mutants that retain activity and the desired phenotypes. Following mutagenesis of SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 the encoded protein can be expressed by any recombinant technology known in the art and the activity of the protein can be determined.

Transformed Plants Cells and Transgenic Plants

The invention includes a protoplast, plants cell, plant tissue and plant (e.g., monocot or dicot) transformed with a MYB-subgroup14 nucleic acid, a vector containing a MYB-subgroup14 nucleic acid or an expression vector containing a MYB-subgroup14 nucleic acid. As used herein, “plant” is meant to include not only a whole plant but also a portion thereof (i.e., cells, and tissues, including for example, leaves, stems, shoots, roots, flowers, fruits and seeds).

The plant can be any plant type including, for example, species from the genera Arabidopsis, Brassica, Oryza, Zea, Sorghum, Gossypium, Triticum, Glycine, Pisum, Phaseolus, Lycopersicon, Trifolium, Cannabis, Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Lolium, Avena, Hordeum, Secale, Picea, Caco, and Populus.

The invention also includes cells, tissues, including for example, leaves, stems, shoots, roots, flowers, fruits and seeds and the progeny derived from the transformed plant.

Numerous methods for introducing foreign genes into plants are known and can be used to insert a gene into a plant host, including biological and physical plant transformation protocols (See, for example, Miki et al., (1993) “Procedure for Introducing Foreign DNA into Plants”, In: Methods in Plant Molecular Biology and Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, pages 67-88; and Andrew Bent in, Clough S J and Bent A F, (1998) “Floral dipping: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana”). The methods chosen vary with the host plant, and include chemical transfection methods such as calcium phosphate, polyethylene glycol (PEG) transformation, microorganism-mediated gene transfer such as Agrobacterium (Horsch, et al., Science, 227: 1229-31 (1985)), electroporation, protoplast transformation, micro-injection, flower dipping and biolistic bombardment.

Agrobacterium-Mediated Transformation

The most widely utilized method for introducing an expression vector into plants is based on the natural transformation system of Agrobacterium tumefaciens and A. rhizogenes which are plant pathogenic bacteria which genetically transform plant cells. The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectfully, carry genes responsible for genetic transformation of plants (See, for example, Kado, Crit. Rev. Plant Sci., 10: 1-32 (1991)). Descriptions of the Agrobacterium vector systems and methods for Agrobacterium-mediated gene transfer are provided in Gruber et al., supra; and Moloney, et al, Plant Cell Reports, 8: 238-242 (1989).

Transgenic Arabidopsis plants can be produced easily by the method of dipping flowering plants into an Agrobacterium culture, based on the method of Andrew Bent in, Clough S J and Bent A F, 1998. Floral dipping: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Wild type plants are grown until the plant has both developing flowers and open flowers. The plants are inverted for 1 minute into a solution of Agrobacterium culture carrying the appropriate gene construct. Plants are then left horizontal in a tray and kept covered for two days to maintain humidity and then righted and bagged to continue growth and seed development. Mature seed is bulk harvested.

Direct Gene Transfer

A generally applicable method of plant transformation is microprojectile-mediated transformation, where DNA is carried on the surface of microprojectiles measuring about 1 to 4 μm. The expression vector is introduced into plant tissues with a biolistic device that accelerates the microprojectiles to speeds of 300 to 600 m/s which is sufficient to penetrate the plant cell walls and membranes. (Sanford, et al., Part. Sci. Technol., 5: 27-37 (1987); Sanford, Trends Biotech, 6: 299-302 (1988); Sanford, Physiol. Plant, 79: 206-209 (1990); Klein, et al., Biotechnology, 10: 286-291 (1992)).

Plant transformation can also be achieved by the Aerosol Beam Injector (ABI) method described in U.S. Pat. Nos. 5,240,842, 6,809,232. Aerosol beam technology is used to accelerate wet or dry particles to speeds enabling the particles to penetrate living cells Aerosol beam technology employs the jet expansion of an inert gas as it passes from a region of higher gas pressure to a region of lower gas pressure through a small orifice. The expanding gas accelerates aerosol droplets, containing nucleic acid molecules to be introduced into a cell or tissue. The accelerated particles are positioned to impact a preferred target, for example a plant cell. The particles are constructed as droplets of a sufficiently small size so that the cell survives the penetration. The transformed cell or tissue is grown to produce a plant by standard techniques known to those in the applicable art.

Regeneration of Transformants

The development or regeneration of plants from either single plant protoplasts or various explants is well known in the art (Weissbach and Weissbach, 1988). This regeneration and growth process typically includes the steps of selection of transformed cells, culturing those individualized cells through the usual stages of embryonic development through the rooted plantlet stage. Transgenic embryos and seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil.

The development or regeneration of plants containing the foreign, exogenous gene that encodes a polypeptide of interest introduced by Agrobacterium from leaf explants can be achieved by methods well known in the art such as described (Horsch et al., 1985). In this procedure, transformants are cultured in the presence of a selection agent and in a medium that induces the regeneration of shoots in the plant strain being transformed as described (Fraley et al., 1983). In particular, U.S. Pat. No. 5,349,124 (specification incorporated herein by reference) details the creation of genetically transformed lettuce cells and plants resulting therefrom which express hybrid crystal proteins conferring insecticidal activity against Lepidopteran larvae to such plants.

This procedure typically produces shoots within two to four months and those shoots are then transferred to an appropriate root-inducing medium containing the selective agent and an antibiotic to prevent bacterial growth. Shoots that rooted in the presence of the selective agent to form plantlets are then transplanted to soil or other media to allow the production of roots. These procedures vary depending upon the particular plant strain employed, such variations being well known in the art.

Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants, or pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important, preferably inbred lines. Conversely, pollen from plants of those important lines is used to pollinate regenerated plants. A transgenic plant of the present invention containing a desired polypeptide is cultivated using methods well known to one skilled in the art.

A preferred transgenic plant is an independent segregant and can transmit the MYB68 gene and its activity to its progeny. A more preferred transgenic plant is homozygous for the gene, and transmits that gene to all offspring on sexual mating. Seed from a transgenic plant may be grown in the field or greenhouse, and resulting sexually mature transgenic plants are self-pollinated to generate true breeding plants. The progeny from these plants become true breeding lines that are evaluated for increased expression of the MYB68 transgene.

Method of Producing Transgenic Plants

Included in the invention are methods of producing a transgenic plant. The method includes introducing into one or more plant cells a compound that alters expression or activity of a MYB-subgroup14 in the plant to generate a transgenic plant cell and regenerating a transgenic plant from the transgenic cell. The compound increases MYB-subgroup14 expression or activity. The increased expression and or activity can additionally be directed to occur ectopically or constitutively or in a tissue specific manner. The compound can be, e.g., (i) a MYB-subgroup14 polypeptide; (ii) a MYB-subgroup14 nucleic acid and analogs and homologs thereof; (iii) a nucleic acid that increases expression of a MYB-subgroup14 nucleic acid. A nucleic acid that increases expression of a MYB-subgroup14 nucleic acid may include promoters or enhancer elements. The promoter is a heterologous promoter or a homologous promoter. Additionally, the promoter is a constitutive or an inducible promoter. Promoters include for example, organ specific promoter or tissue specific promoter. Promoter suitable for directing gene expression are know in the art and are described herein. Enhancer elements are known to those skilled in the art. For example the enhancer element is a 35S enhancer element.

By increasing the expression of a MYB subgroup-14 polypeptide is meant that the amount produced by the cell transformed with the nucleic acid construct is greater than a cell, e.g. control cell that is not transformed with the nucleic acid construct. A control cell includes for example a cell that endogenously expresses a MYB subgroup-14 polypeptide such as a plant root cell, alternatively a control cell is a non transformed cell of the same cell-type as the transformed cell, be it a leaf cell a meristem cell or a flower or seed cell. An increase is a 1-fold, 2-fold, 3 fold or greater increase. An increase of expression is also meant to include expression of a MYB subgroup-14 polypeptide in a cell that does not typically express a MYB subgroup-14 polypeptide.

The nucleic acid can be either endogenous or exogenous. Preferably, the compound is a MYB-subgroup14 polypeptide or a MYB-subgroup14 nucleic acid encoding a MYB-subgroup14 polypeptide. For example the compound comprises the nucleic acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265. Preferably, the compound is a MYB-subgroup14 nucleic acid sequence from an endogenous source to the species being transformed. Alternatively, the compound is a MYB-subgroup14 nucleic acid sequence from an exogenous source to the species being transformed.

Also included in the invention are methods of producing a transgenic plant. The method includes introducing into one or more plant cells a compound that alters a MYB-subgroup14 nucleic acid expression or activity in the plant to generate a transgenic plant cell and regenerating a transgenic plant from the transgenic cell. The compound increases a MYB-subgroup14 sequence expression or activity. The compound can be, e.g., (i) a MYB-subgroup14 polypeptide; (ii) a MYB-subgroup14 nucleic acid and analogs and homologs thereof; (iii) a nucleic acid that increases expression of a MYB-subgroup14 nucleic acid. A nucleic acid that increases expression of a MYB-subgroup14 nucleic acid may include promoters or enhancer elements. The nucleic acid can be either endogenous or exogenous. Preferably, the compound is a MYB-subgroup14 polypeptide or a MYB-subgroup14 nucleic acid. For example the compound comprises the nucleic acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265. Preferably, the compound is a MYB-subgroup14 nucleic acid sequence endogenous to the species being transformed. Alternatively, the compound is a MYB-subgroup14 nucleic acid sequence exogenous to the species being transformed.

An exogenous MYB-subgroup14 sequence expressed in a host species need not be identical to the endogenous MYB-subgroup14 sequence. For example, sequences of three Arabidopsis GAMYB-like genes were obtained on the basis of sequence similarity to GAMYB genes from barley, rice, and L. temulentum. These three Arabidopsis genes were determined to encode transcription factors (AtMYB33, AtMYB65, and AtMYB101) and could substitute for a barley GAMYB and control alpha-amylase expression (Gocal et al. (2001) Plant Physiol. 127: 1682 1693).

Maize, petunia and Arabidopsis MYB transcription factors that regulate flavonoid biosynthesis are very genetically similar and affect the same trait in their native species, therefore sequence and function of these MYB transcription factors correlate with each other in these diverse species (Borevitz et al. (2000) Plant Cell 12: 2383-2394).

Therefore an expressed MYB-subgroup14 need only be functionally recognized in the host cell. Expression of MYB-subgroup14 encoding nucleic acids in Arabidopsis provides the basis of a functionally equivalent assay. For example expression of a MYB-subgroup14 from a Brassica, soybean, cotton or corn source in Arabidopsis and assessment of the heat tolerance demonstrates functional equivalence and provides a sound basis for prediction that the exogenous sequence is a MYB-subgroup14 gene and functions accordingly.

Disclosed herein is a description of expression of MYB-subgroup14 sequences from Arabidopsis, Brassica and soybean that have been demonstrated to be functional in Arabidopsis in that the resulting plants have increased heat tolerance as indicated by reduced flower abortion and increased seed set under heat stress conditions during flowering.

In various aspects the transgenic plant has an altered phenotype as compared to a wild type plant (i.e., untransformed). By altered phenotype is meant that the plant has a one or more characteristic that is different from the wild type plant. For example, when the transgenic plant has been contacted with a compound that increases the expression or activity of a MYB-subgroup14 nucleic acid, the plant has a phenotype such as increased heat tolerance as compared to a wild type plant and manifests this trait in phenotypes such as decreased flower abortion, increased seed set and development, increased yield protection and protection of pollen development and protection of meristems, particularly flower meristems, from heat damage, drought tolerance and salt tolerance for example. Plants with a reduced flower abortion have a 5, 10, 20, 25, 30% or more increase in seed yield as compared to a control plant.

The plant can be any plant type including, for example, species from the genera Arabidopsis, Brassica, Oryza, Zea, Sorghum, Gossypium, Triticum, Glycine, Pisum, Phaseolus, Lycopersicon, Trifolium, Cannabis, Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Lolium, Avena, Hordeum, Secale, Picea, Caco, and Populus.

Method of Identifying a Heat Stress Tolerant Plant

Also included in the invention is a method of identifying a heat stress tolerant plant. The plants identified by these methods have reduced flower abortion and increased yield as compared to a control plant. Heat stress tolerant plants are identified by exposing a population of flowering plants to a heat stress treatment and selecting a plant from the population of plants that has reduced flower abortion. Heat stress treatment includes for example exposing the plant to a temperature that is hot enough for a sufficient amount of time such that damage to plant functions or development results. By reduced flower abortion is meant that a plant does not loss as many flowers, due to flower abortion, or has a greater seed yield compared to another plant that is exposed to a similar level of heat stress. Plants with a reduced flower abortion have a 5, 10, 20, 25, 30% or more increase in seed yield as compared to a control plant.

EXAMPLES

The invention will be further illustrated in the following non-limiting examples.

Example 1: Identification of Heat Tolerant Mutant

Arabidopsis thaliana var. Columbia was transformed with pSKI15 vector containing a 4×35S enhancer sequence (Wiegel et al., 2000). A T3 population of Arabidopsis seed was obtained from ABRC and used to produce a T4-generation that was used in genetic screen experiments. The Arabidopsis h138 mutant was identified as having reduced or no flower abortion, relative to a wild type control, when exposed to a heat stress during flowering of about 45° C. for about 30 to 60 minutes. Initial isolates were retested by having flowering plants subjected to a 1 hour temperature ramp-up from 22° C. to 45° C. followed by a 2 hour heat stress of 45° C., flower production, seed set and seed development was monitored and heat tolerant lines selected.

Example 2: Identification of the Heat Tolerant MYB68 Gene

Genome walking to localize the T-DNA activation tag insertion was performed as follows. Genomic DNA was purified by phenol:chloroform extraction using 10-day-old seedlings of mutant h138. The isolated DNA was subsequently digested by the restriction enzymes such as EcoRV, PvuII, NruI, or StuI to generate DNA fragments with blunt ends. The resulting fragments from each digestion were ligated to an adaptor that was formed by the annealing of two oligos: Adaptor 1 and Adaptor 2. The addition of the adaptor to the DNA fragments enables PCR amplification using primers specific to the adaptor and a T-DNA insertion site.

Two rounds of PCR were used to generate DNA fragments for further sequencing analysis. Primer HeatL1 (SEQ) that is specific to the T-DNA left border, and primer CAP1 (SEQ) that is specific to the adaptor, were used for the 1^(st) PCR. The resulted PCR products were diluted 50 folds to serve as templates for 2^(nd) PCR. A confirmed DNA fragment was then amplified by two nested primers HeatL2 (SEQ) and CAP2 (SEQ). PCR programs TOUCH1 (6 cycles of 94° C., 25 sec; 72° C., 7 min; 32 cycles of 94° C., 25 sec; 67° C., 7 min and 1 cycle of 67° C., 10 min) and TOUCH2 (4 cycles of 94° C., 25 sec; 72° C., 7 min; 20 cycles of 94° C., 25 sec; 67° C., 7 min and 1 cycle of 67° C., 10 min) were used for the two rounds of PCR. All PCR was carried out using Ex-Taq as DNA polymerase and a Biometra® thermocycler. The PCR products were sequenced, and the flanking genomic sequences identified. The 4×35S enhancers were inserted into an intergenic region that is 5 kb down stream of 3′ end of genomic AtMYB68 (AT5G65790) on chromosome 5. Northern analysis and real-time PCR showed that the expression of MYB68 in h138 was induced to more than 2 fold relative to wild type.

Example 3: Physiological Characterization of the h138 Mutant (Myb68)

Plants were assessed for heat tolerance during flowering and scored based on the number of aborted flowers or pods and final seed yield. Plants were grown in controlled environment chambers where optimal growth conditions were 16 hr light 200 uE and 8 hr dark, 22° C. and 70% relative humidity. Three groups of plants were used in the experimental design; 1) A control group grown under optimal conditions; 2) a 3-hour heat treatment group and; 4) a 4-hour heat treatment group. Heat treatment was performed 6 days after first open flower and the temperature was ramped from 22° C. to 44° C. over a 1-hour period. Each group of plants contained the myb68 mutant and its wild type control (myb68-null) with 10 replicate pots per entry per treatment with each pot containing 5 plants. Plants were assessed for flower abortion a week following the heat stress treatments then left to grow under optimal conditions until maturity. Final seed yield per pot was determined for all 3 groups of plants.

Following heat stress the seed yield of myb68 was lower than that of the myb68-null control in both the 3-hour (25%) and 4-hour (17%) stress treatments however the difference was only statistically significant for the 3-hour treatment. The 3-hour treatment resulted in 32% fewer aborted pods relative to myb68-null and the final seed yield was increased by 16% relative to myb68 plants grown in optimal conditions. The 4-hour treatment also resulted in a 16% increase in seed yield relative to optimally grown myb68 plants. In contrast, the myb68-null showed 15% and 23% reductions in seed yield relative to optimally grown plants. The overall yield protection provided by the myb68 mutation was 31% and 39%, relative to the wild-type. Additional experiments have shown results of yield protection ranging from 5% to 44% depending on the experimental conditions.

Example 4: Constructs Useful for Expression of MYB-Subgroup14 Sequences Including MYB68

According to the methods described below, expression vector constructs can be produced using appropriate promoters and a MYB gene of the invention. For example any of the gene sequences described by the SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265. Such vector constructs are useful to produce a MYB68 gene, operably linked to a sequence that functions as a promoter in a plant cell and to operably express said gene and protein encoded by the gene.

Vectors to over-express MYB68 under regulatory control of either constitutive or conditional promoters may be constructed, as described below. The sequence encoding a MYB68 open reading frame has been operably linked to the promoter sequences of the 35S CaMV constitutive promoter, the P18.2 or P81.1 heat inducible promoters and its endogenous PMYB68 promoter. Additionally the genomic sequence of MYB68 has been cloned behind the 35S CaMV constitutive promoter in a pEGAD vector backbone.

35S-Genonic AtMYB68 (in pEGAD Vector (35S-gAtMYB68)

A 1.4 kb of MYB68 genomic DNA including 83 bps of 3′ UTR was amplified by PCR using primers: MYB68FW-BamH3 (5′-AAAGGATCCATGGGAAGAGCACCGTGTTG-3′) (SEQ ID NO:300) and MYB68RV-BamH4 (5′-AAAGGATCCCCACTCCCTAAAGACACAGATTT-3′) (SEQ ID NO:301), and subsequently digested with BamHI. The resulting DNA fragment was ligated into pBluescript II SK (+/−), and then subcloned into pEGAD at the same site to obtain 35S-gemonicAtMYB68 (35S-gAtMYB68) in pEGAD.

35S AtMYB68, 35S-AtMYB84, 35S-AtMYB36, 35S-AtMYB37, 35S AtMYB38, 35S-AtMYB87

AtMYB84 (At3g49690), AtMYB36 (At5g57620), AtMYB37 (At5g23000), AtMYB38 (At2g36890) and AtMYB87 (At4g37780) are classified as members of the MYB-subgroup14 family along with AtMYB68 (Stracke et al., 2001), therefore it is possible that their functions are redundant. These MYB genes are over-expressed in Arabidopsis to test their functionality as an AtMYB68 orthologue with respect to heat tolerance. The cDNA sequences are amplified by RT-PCR, and cloned into pBI121 without GUS to generate constructs of 35S-AtMYB84, 35S-AtMYB36, 35S-AtMYB37, 35S-AtMYB38 and 35S-AtMYB87.

A 1.1 kb of AtMYB68 cDNA was produced by RT-PCR using primers HG2F (5′-AAATCTAGAATGGGAAGAGCACCGTGTT-3′) (SEQ ID NO:302) and HG2R (5′-AAAGGATCCTTACACATGATTTGGCGCAT-3′) (SEQ ID NO:303), and digested with XbaI and BamHI. The resulting DNA fragment was cloned into pBluescript II SK (+/−), and then into pBI121 without GUS to generate 35S-MYB68. pBI121 without GUS was obtained by SmaI and EcolcR1 double digestion and followed by self-ligation of the remaining vector.

The coding sequence of AtMYB84 (933 bp, AtMYB84, At3g49690) was amplified by RT-PCR using forward primer 690M84-Xba-FW containing an XbaI site (5′-acgt TCTAGA ATG GGA AGA GCA CCG TGT TG-3′) (SEQ ID NO:273) and reverse primer 690M84-Bam-Re containing a BamHI site (5′-atcg GGATCC TTA AAA AAA TTG CTT TGA ATC AGA ATA-3′) (SEQ ID NO:274). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB84.

The coding sequence of AtMYB36 (1002 bp, AtMYB36, At5g57620) was amplified by RT-PCR using forward primer M36-Xb-FW containing an XbaI site (5′-actg TCTAGA ATG GGA AGA GCT CCA TGC TG-3′) (SEQ ID NO:304) and reverse primer M36-Bm-Re containing a BamHI site (5′-cagt GGATCC TTA AAC ACT GTG GTA GCT CAT C-3′) (SEQ ID NO:305). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB36.

The coding sequence of AtMYB37 (990 bp, AtMYB37, At5g23000) was amplified by RT-PCR using forward primer AM37-Xb-FW containing an XbaI site (5′-actg TCTAGA ATG GGA AGA GCT CCG TGT TG-3′) (SEQ ID NO:306) and reverse primer AM37-Bm-Re containing a BamH I site (5′-acgt GGATC CTA GGA GTA GAA ATA GGG CAA G-3′) (SEQ ID NO:307). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB37.

The coding sequence of AtMYB38 (897 bp, AtMYB38, At2g36890) was amplified by RT-PCR using forward primer AM38-Xb-FW containing an XbaI site (5′-actg TCTAGA ATG GGT AGG GCT CCA TGT TGT-3′) (SEQ ID NO:308) and reverse primer AM38-Bm-Re containing a BamH I site (5′-acgt GGATCC TCA GTA GTA CAA CAT GAA CTT ATC-3′) (SEQ ID NO:309). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB38.

The coding sequence of AtMYB87 (918 bp, AtMYB87, At4g37780) will be amplified by RT-PCR using forward primer M87-Xb-FW containing an XbaI site (5′-aaaa TCTAGA ATG GGA AGA GCA CCG TGC-5′) (SEQ ID NO:310) and reverse primer M87-Bg-Re containing a Bgl2 site (5′-aaaa AGATCT CTA CTC ATT ATC GTA TAG AGG-3′) (SEQ ID NO:311). The PCR product will be cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB87.

P18.2-MYB68, and P81.1-MYB68

The construction involved 4-steps; 1) a 869 bp of Hsp18.2 promoter, and a 406 bp of Hsp81.1 promoter were amplified by PCR using primer sets: HP1F (SEQ ID NO: 277) and HP1R (SEQ ID NO:278), and HP2F (SEQ ID NO:279) and HP2R (SEQ ID NO:280), respectively, and digested with SalI and XbaI. The resulting DNA fragments were cloned into pBI101 at the same sites to generate the new vectors: P18.2pBI101 and P81.1pBI101; 2) a MCS2-oligo (including restriction sites of XbaI, HpaI, AgeIl, KpnI, XhoI, ScaI, SpeI, SalI, BamHI and SmaI) was cloned into the new vectors at XbaI and SmaI sites. The resulting vectors were named P18.2pBI101MCS and P81.1pBI101MCS; 3) the GUS gene was removed by SmaI and EcolcR1 double digestion and followed by self-ligation of the remaining vector to give vectors P18.2pBI101MCS without GUS and P81.1pBI101MCS without GUS; 4) the 1.1 kb of MYB68 cDNA fragment was ligated into the two newer vectors at XbaI and BamHI sites to complete the construction of P18.2pBI101MCS without GUS for P18.2-MYB68, and P81.1MCSpBI121 without GUS for P81.1-MYB68.

pHSP81.1-AtMYB68

The coding sequence of AtMYB68 was isolated by restriction digestion with XbaI and BamHI from plasmid pHSP18.2-AtMYB68, and cloned at the XbaI-BamHI sites in pHSP81.1.

pHPR-AtMYB68

The promoter sequence (−1 to −506 bp, relative to ATG start codon) of the Arabidopsis hydroxy pyruvate reductase gene (HPR, At1g68010) was amplified by PCR from Arabidopsis genomic DNA using a forward primer containing a Sal I site (HPR-Sal-FW, acgt gtcgac GAAGCAGCAGAAGCCTTGAT) (SEQ ID NO:312) and a reverse primer containing an Xba I site (HPR-Xb-R2, acgt tctaga GGT AGA GAA AAG AGA aag cct c) (SEQ ID NO:313). The digested fragment was cloned into the vector pHSP81.1-AtMYB68 that was pre-digested with SalI and XbaI to remove the HSP81.1 promoter. This generates a recombinant plasmid with the HPR promoter placed in front of AtMYB68.

PMYB68-AtMYB68

The AtMYB68 promoter (−1 through −1034 with respect to the MYB68 ATG start codon) was amplified by PCR using primers: Pm68-H3-FW (SEQ ID NO:275) and Pm68-Av-Xh-Re (SEQ ID NO:276), and digested by restriction enzymes: HindIII and XhoI. The resulting promoter fragment was cloned into P81.1MCSpBI121 without GUS at the same sites, replacing the Hsp81.1 promoter. This vector is then named PMYB68pBI121, and used for further cloning of AtMYB68 cDNA (1.1 kb) at Avr II and BamHI sites to obtain PMYB68-AtMYB68. The AvrII-BamHI fragment of MYB cDNA was recovered from the plasmid 18.2-MYB68.

pM68-AtMYB84

The coding sequence of AtMYB84 (933 bp, AtMYB84, At3g49690) was amplified by RT-PCR using forward primer 690M84-Xba-FW containing an XbaI site (5′-acgt TCTAGA ATG GGA AGA GCA CCG TGT TG-3′) (SEQ ID NO:273) and reverse primer 690M84-Bam-Re containing a BamHI site (5′-atcg GGATCC TTA AAA AAA TTG CTT TGA ATC AGA ATA-3′) (SEQ ID NO:274). The PCR product was cloned at the AvRII-BamHI sites in pB-Pm68, generating construct of AtMYB84 under control of the AtMYB68 promoter.

pM68-AtMYB36

The coding sequence of AtMYB36 (1002 bp, At5g57620) was amplified from RNA isolated from young Arabidopsis seedlings (leaves and roots) by RT-PCR using forward primer M36-Xb-FW containing an XbaI site (5′-actg TCTAGA ATG GGA AGA GCT CCA TGC TG-3′) (SEQ ID NO:305) and reverse primer M36-Bm-Re containing a BamHI site (5′-cagt GGATCC TTA AAC ACT GTG GTA GCT CAT C-3′) (SEQ ID NO:306). The PCR product was cloned at the Avr II and BamHI sites in pBI-Pm68 described above. This generated a construct of AtMYB36 under control of the AtMYB68 promoter.

35S-OsMYB36

Rice MYB36 cDNA homolog: The coding sequence (966 bp) of a rice MYB36 gene (SEQ ID NO:9), encoding a protein identified as SEQ ID NO:10 was amplified by RT-PCR from rice root RNA using forward primer rM-Xb-FW2 containing an XbaI site (5′-acgt TCTAGA ATG GGG AGA GCG CCG TGC TG-3′) (SEQ ID NO:314) and reverse primer rM-Bm-Re2 containing a BamH I site (5′-tgca GGATCC CTA CTG CAT CCC GAG GTC AG CT-3′) (SEQ ID NO:315). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct 35S-Os MYB36.

35S-gOs MYB36

Rice MYB genomic homolog clone: Using the same primers described above forward primer rM-Xb-FW2 (SEQ ID NO:314) and reverse primer rM-Bm-Re2 (SEQ ID NO:315), the genomic sequence of the rice MYB36 gene (SEQ ID NO:265) was amplified (1259 bp). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct 35S-gOsMYB36.

35S-GmMYB84

The soybean MYB161 is a homolog of Arabidopsis MYB84. Herein the term ‘soybean MYB84’ is used interchangeably with Soybean MYB161. The 1068 bp coding sequence of a soybean MYB161 was cloned by RT-PCR from soybean root RNA using forward primer soybM-Xba-FW2 containing an XbaI site (5′-acgt TCTAGA ATG GGG AGG GCA CCT TGC T-3′) (SEQ ID NO:316) and reverse primer soybM-Bm-Re containing a BamHI site (5′-acgt GGATC CTA TTG CGC CCC CGG GTA G-3′) (SEQ ID NO:317). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct 35S-GmMYB84.

35S-ZmMYB36

The corn MYB36 cDNA (SEQ ID NO:261) was amplified by PCR using primers: ZmYYBFW-XbaI (5′-aaatctagaATGGGGAGAGCTCCGTGCTGCGACA-3′) (SEQ ID NO:318) and ZmMYBRV-BamHI2 (5′-aaaggatccCTACTTCATCCCAAGGTTTCCTGGC-3′) (SEQ ID NO:319). The DNA fragment was digested by XbaI and BamHI and subsequently ligated to the same sites at pBluescript II SK (+), and then subcloned into the same sites of pBI121 replacing GUS.

35S-GhMYB68 or 35SS-CotMYB68

The cotton MYB68 cDNA was amplified by PCR using primers: CotM-Xb-Fw (5′-acgt TCTAGA ATG GGG AGA GCT CCT TGT TG-3′) (SEQ ID NO:320) and CotM-Bm-Re (5′-acgt GGATCC CTA TTG CGC TCC TCC TGG G-3′) (SEQ ID NO:321). The DNA fragment was digested by XbaI and BamHI. It was ligated to the same sites at pBluescript II SK (+), and then subcloned into the same sites in pBI121 replacing GUS.

35S-BnMYB68r

The canola root MYB cDNA was amplified by PCR using primers: Bn68root-FW-XbaI (5′-aaatctagaATGGGAAGAGCACCGTGTTGTGATAAGGCC-3′) (SEQ ID NO:322) and Bn68root-RV-BamHI (5′-aaaggatccTTACACATTATTTGGCCCATTGAAGTATCTTGC-3′) (SEQ ID NO:323). The DNA fragment was digested by XbaI and BamHI. It was ligated to the same sites at pBluescript II SK (+). The same fragment was then subcloned to the same sites in pBI121 replacing GUS.

35S-BnMYB68b

The canola bud MYB cDNA was amplified by PCR using primers: Bn68Bud-FW-XbaI (5′-aaatctagaATGGGAAGAGCACCGTGTTGTGACAAGGCT-3′) (SEQ ID NO:324) and Bn68Bud-RV-BamHI (5′-aaaggatccTTACAAATGATTTGCCCCATTGAAGTAACTTGC-3′) (SEQ ID NO:325). The DNA fragment was digested by XbaI and BamHI. It was ligated to the same sites at pBluescript II SK (+). The same fragment was then subcloned to the same sites in pBI121 replacing GUS.

Table 2 below describes oligonucleotide primers used to make the vector constructs described above, and additional primers useful for cloning AtMYB homologues.

TABLE 2 Oligonucleotide primers synthesized for cloning AtMYB68 homologues SEQ ID Restriction NO Primer name site Sequence (5′-3′) Remark 271 790M68-Xba- XbaI ACGT TCTAGA ATG GGA AGA AtMYB68 FW GCA CCG TGT TG (at5g65790) 272 790M68-Bam- BamHI ATCG GGATCC TTA CAC ATG ATT AtMYB68 Re TGG CGC ATT G (at5g65790) 273 690M84-Xba- Xba I ACGT TCTAGA ATG GGA AGA AtMYB84 FW GCA CCG TGT TG (at3g49690) 274 690M84-Bam- Bam HI ATCG GGATCC TTA AAA AAA TTG AtMYB84 Re CTT TGA ATC AGA ATA (at3g49690) 275 Pm68-H3-FW Hind III ACGT AAGCTT TCG TAA AAT CTC AtMYB68 TCA TG Promoter 276 Pm68-Av-Xh- Avr II and GTCA CTCGAG CCTAGG TTT CTT AtMYB68 Re Xho I GAT TCT TGA TTC TTG ATC Promoter 277 HP1f AAAGTCGACGCATCTTTACAATGT AAAGCTTTTCT 278 HP1R AAATCTAGATGTTCGTTGCTTTTC GGG 279 HP2F AAAGTCGACAGAAGACAAATGAG AGTTGGTTTATATTT 280 HP2R AAATCTAGACGCAACGAACTTTG ATTCAA 281 BnMYB68FW2 ATGGGAAGAGCACCGTGTTGTGA Canola MYB68 TAAGGCC (AC189266.1) 282 BnMYB68RV2 TTAATTTGGCGCATTGAAGTAACT Canola MYB68 TGCATCTTCGG (AC189266.1) 283 rM-Xb-FW Xba I ACGT TCTAGA ATG GGG AGA Rice MYB GCG CCG TGC (AAT85046) 284 rM-Bm-Re BamH I TGCA GGATC CTA CTG CAT CCC Rice MYB GAG GTC AG (AAT85046) 285 cotM-Xb-FW Xba I ACGT TCTAGA ATG GGG AGA Cotton MYB GCT CCT TGT TG (TC34239) 286 cotM-Bm-Re BamH I ACGT GGATCC CTA TTG CGC TCC TCC TGG G 287 soybM-Xba- XbaI ACGT TCTAGA ATG GGG AGG Soybean MYB FW2 GCA CCT TGC T (ABH02906) 288 cornM-Xba- XbaI I ACGT TCTAGA ATG GGG AGA Corn MYB FW2 GCT CCG TGC T (TC370133) 289 wheatM-Xba- XbaI ACTG TCTAGA ATG GGG AGG Wheat MYB FW GCG CCG TGC (BQ483726) 290 MedtM-Xba- XbaI ACTG TCTAGA ATG GGA AGA M. truncatula FW GCT CCT TGC TGT MYB (TC97441) 291 sorgM-Xb-FW XbaI ACGTTCTAGA ATGGGGAGAG sorghum MYB CTCCGTGCT (AAL84760) 292 toM-Xb-FW XbaI ACTGTCTAGAATGGGAAGAGCTC Tomato blind CATGTTGT (AAL69334) 293 toM-Bm-Re BamtII GACT GGATCC TTA GTA ATA AAA CAT CCC TAT CTC A 294 popM-Xb-FW Xba I ACGT TCTAGA ATG GGG AGA Poplar MYB GCT CCT TGC TG (TC54478) 295 popM-Bm-Re BamtII GACT GGATCC TCA TTG TGG CCC Poplar MYB AAA GAA GCT (TC54478) 296 HSP18.2 HP1F AAAGTCGACGCATCTTTACAATGT AAAGCTTTTCT 297 HSP18.2 HP1R AAATCTAGATGTTCGTTGCTTTTC GGG 298 HSP81.1 HP2F AAAGTCGACAGAAGACAAATGAG AGTTGGTTTATATTT 299 HSP81.1 HP2R AAATCTAGACGCAACGAACTTTG ATTCAA Note: 1. FW: Forward primer with gene specific sequence starting from the ATG start codon. 2. Re: Reverse primer with gene specific sequence starting at the stop codon. 3. Restriction sites at the 5′ end are underlined.

The expression vector constructs of the invention can be introduced into Arabidopsis, the plant of origin or any species of choice. For example an Arabidopsis MYB gene may be over expressed in a Brassica species or alternatively a soybean, maize, rice or cotton species.

Example 5: Amino Acid Sequence Analysis of MYB68

The tables below provide a comparison of amino acid sequences from the MYB gene family across different plant species, under different settings for multiple sequence alignment and amino acid sequence analysis. Note: Different MYB naming and numbering systems are used in the literature and databases for different plant species. In the tables below, (*) indicates the predicted ORF genomic DNA sequence was edited according to peptide sequence alignments to generate a putative coding sequence sequence. The (P) designation indicates a partial sequence. Sequence homology and multiple sequence alignments were compared by ClustalW.

TABLE 3 Protein Multi-alignment scores to Sequence size AtMYBs (%, Clustal) Species Name file (a.a.) MYB68 MYB84 MYB36 AtMYB68 100 AtMYB84 At3g49690 310 64 100 SEQ ID NO: 4 AtMYB36 At5g57650 333 35 39 100 SEQ ID NO: 6 Canola AC189266 364 (*) 88 59 33 SEQ ID NO: 8 Rice Rice MYB 360 34 36 s8137 Rice MYB AAT85046 321 39 40 39 s3656 SEQ ID NO: 10 Soybean Soybean ABH02831 259 21 20 MYB84 Soybean ABH02839 317 22 20 MYB84 Soybean ABH02906 198 (P) 63 65 58 MYB161 SEQ ID NO: 14 Soybean ABH02912 209 (P) 58 56 MYB71 Corn TC32080 361 33 38 TC32080 131 79 80 ZmMYB- AF099429 43 (P) 86 86 IP30 ZmMYB- TC370133 131 (P) 82 83 83 IP30 SEQ ID NO: 18 ZmMYB- AF099383 43 (P) 81 81 HX43 Corn CAJ42201 226 33 35 MYB8 Corn CAJ42202 275 26 26 MYB31 Corn P20025 255 30 30 MYB38 Cotton Cotton TC34239 356 40 41 34 MYB SEQ ID NO: 12 Cotton AAK19616 309 32 27 GHMYB25 Cotton AAK19619 264 28 25 GHMYB9 Cotton AAZ83352 307 31 29 GHMYB30 Sorghum Sorghum AAL90639 87 (P) 60 62 MYB68 Sorghum AAQ54875 157 (P) 69 70 MYB86 Sorghum AAL84760 157 (P) 69 70 75 MYB20 SEQ ID NO: 20 Sorghum AAL84761 203 46 40 MYB34 Sbi_042749 318 38 37 203 55 53 157 63 63 Medicago ABE78637 336 28 31 truncatula ABE90877 319 24 24 TC97441 178 (P) 70 70 66 SEQ ID NO: 26 Tomato Blind AAL69334 315 39 SEQ ID NO: 28 EST467561 237 51 51 SEQ ID NO: 30 TC182203 185 (P) 60 54 64 AAL69334 185 62 58 62 EST467561 185 61 58 59 Wheat BQ483726 175 (P) 66 65 70 SEQ ID NO: 22 Poplar TC54478 345 39 44 38 SEQ ID NO: 24

TABLE 4 ATMYB68 Homologues from Different Crops/Species Protein Multi-alignment scores to AtMYBs Sequence size (%, ClustalW) Name file (a.a.) AtMYB68 AtMYB84 AtMYB36 AtMYB68 At5g65790 374 100 SEQ ID NO: 2 AtMYB84 At3g49690 310 64 100 AtMYB36 At5g57620 333 35 39 100 Canola AC189266 364 ((*)) 88 59 33 Soybean ABH02906 198 (P) 63 65 58 Cotton TC34239 356 40 41 34 Tomato Blind 315 39 AAL69334 Medicago TC97441 178 (P) 70 70 66 truncatula Rice AAT85046 321 39 40 38 Corn TC370133 131 (P) 82 83 83 Wheat BQ483726 175 (P) 66 65 70 Sorghum AAL84760 157 (P) 69 70 75 Poplar TC54478 345 39 44 38

In Table 4 above, sequence homology and multiple sequence alignments were compared by ClustalW at http://www.ebi.ac.uk/clustalw/ with the following default settings:

Matrix: Gonnet 250

GAP OPEN: 10.0

END GAPS: −1

GAP EXTENSION: 0.2

GAP DISTANCES: 4

TABLE 5 ATMYB68 Homologues from Different Crops/Species Sequence homology Protein size (%) to AtMYB68 Species Sequence file (a.a.) Protein DNA AtMYB68 At5g65790 374 100 100 SEQ ID NO: 2 AtMYB84 At3g49690 310 64 68 AtMYB36 At5g57620 333 37 29 Canola AC189266 364 ((*)) 88 90 Soybean ABH02906 198 (P) 63 53 Cotton TC34239 356 40 30 Tomato Blind 315 39 28 (AAL69334) Medicago TC97441 178 (P) 70 58 truncatula Rice AAT85046 321 39 28 Corn TC370133 131 (P) 82 67 Wheat BQ483726 175 (P) 66 53 Sorghum AAL84760 157 (P) 69 59 Poplar TC54478 345 39 30

In Table 5 above, sequence homology and multiple sequence alignments were compared by ClustalW at http://www.ebi.ac.uk/clustalw/ with the following default settings:

Matrix: Protein: Gonnet 250

GAP OPEN: DNA: 15.0 Protein: 10.0

END GAPS: −1

GAP EXTENSION: DNA: 6.66 Protein: 0.2

GAP DISTANCES: 4

Example 6: Physiological Characterization of the 35S-MYB68 Expression Lines

Within a population of transgenic lines a gradation of expression levels and physiological response will exist. In part, the gradation of variation is a result of the site of integration of the gene construct and the local environment for gene expression at that locus. Therefore, lines must be screened and evaluated in order to select the best performing lines. This process is one of routine to one skilled in the art.

Homozygous lines expressing a 35S-MYB68 expression construct have been evaluated in a heat and flower abortion experiment. The experimental set up included 8 replicate plants per line with 1 plant per 3″ pot and grown under optimal conditions in a controlled environment chamber (18 hr light at 200 uE, 6 hr dark, 22° C., 70% relative humidity). Three days after the appearance of the first flower, plants were exposed to a heat shock of 1 hour at 42° C. and returned to optimal conditions for a further 7 days. Plants were assessed for flower abortion on the main stem. Lines were identified that demonstrated reduced flower abortion rates from 34% to 60% relative to wild type controls.

Seed yields and yield protection, expressed as a percent relative to wild type, was determined for nine independent 35S-MYB68 transgenic lines. The experimental set up included 3 plants per 3″ pot which were grown as above with 22 replicates per line. Three days after the appearance of the first flower, 12 replicates per line were exposed to a 3-hour heat stress at 45° C. with a 1 hour ramp up from 22° C. Three days later the heat stress was applied again. The remaining 10 replicates the plants per line were maintained under optimal conditions throughout their life cycle. The final seed yield was determined for all the plants. Wild type plants showed a 40% reduction in yield due to the applied heat stress whereas transgenic lines, while still having a reduced yield due to heat stress the reduction was less severe resulting in a 10% to 12% yield protection.

Selected lines were re-evaluated and stressed as follows. Plants were exposed to a 1-hour ramp up period from 22° C. to 45° C. and a heat stress of 45° C. for 1.5 to 1.8 hours was maintained. Heat stress was applied daily for five consecutive days followed by a five day recovery period and then a sixth heat treatment. The heat treatment resulted in 11% reduction in yield in WT plants and in four of the transgenic lines a yield increase was observed ranging from 2% to 17%. As shown in Table 6, six transgenic lines (68, 80, 93, 73, 83, 30) showed yield protection relative to WT following the heat stress. This protection ranged from 3 to 31%. Two of these transgenic lines (30 and 73) have been shown to have yield protection in the previous experiment and two lines (93 and 83) showed reduction in flower abortion.

TABLE 6 Yield protection Line Heat seed yield relative to WT 68 0.816 31% 80 0.714 26% 93 0.781 17% 73 0.820 14% 83 0.719  5% 30 0.801  3% WT 0.784 — myb68 0.577 15% WT (myb68- 0.779 — null) Characterization of Arabidopsis Lines Expressing the PMYB68-AtMYB68 Construct

Fourteen homozygous transgenic lines at T3 were evaluated in a flower abortion experiment. Plants (fourteen replicates per entry) were grown under optimal conditions (18 hr light, 200 uE, 22 C, 60% RH) in 2.25″ pots until three days from first open flower. At that point plants were exposed to 1 hr treatment at 43 C. Following the heat treatment plants were returned to optimal conditions for 7 more days. The impact of heat treatment on the pod formation was assessed at that point by counting the aborted and damaged (short) pods in the region that was exposed to the heat stress. Six transgenic lines showed a reduction in the total number of damaged pods as compared to controls (wild type-Colombia and the null-n11-8) (see Table 7 below). The best line showed only 70% of the heat damage as the control. Two of the transgenic lines examined here also showed reduced flower abortion at the T2 stage of screening. The myb68 mutant, included as a positive control also showed a reduction in the total number of damaged pods as compared to its segregating null control (myb68-null).

TABLE 7 # of damaged pods total damaged pods Entry Mean Std Err as % of col 15-8 2.3 0.4 70%  3-3 2.4 0.4 72%  4-4 2.9 0.4 87%  8-1 2.9 0.4 89% 27-1 3.1 0.4 93% 20-4 3.2 0.4 98% n-11-8 4.1 0.4 126%  col 3.3 0.4 100%  myb68 1.9 0.5 60% myb68-null 3.2 0.4 100% 

Example 7: Physiological Assessment of Transgenic Plants Expressing MYB68

Arabidopsis Myb68 Mutant Shows Drought Tolerance

Five plants per 3″ pot with 6 replicates per entry were grown under optimal conditions of 16 hr light (180 uE), 22 C, 70% RH in a growth chamber until first open flower. A drought treatment was started equalizing the amount of water in all pots and cessation of watering. Water loss was measured daily by weighing the pots. Soil water content (SWC) was calculated as a % of initial. Plants were harvested on days 0, 2 and 4 of the drought treatment. Water lost relative to shoot dry biomass of the plants was calculated. The myb68 plants show trends towards greater SWC than that of control throughout the drought treatment. Shoot biomass was not different or slightly larger than that of control. The ratio of water lost in 2 days over shoot dry weight on day 2 was lower for the myb68 mutant than control indicating trends towards drought tolerance.

TABLE 8 Soil water content (% of initial) on days of the drought treatment and water lost in 2 days relative to shoot dry weight on day 2. SWC-d2 SWC-d4 Water lost in 2d/ (% Initial) (% Initial) Shoot DW-d2 Entry Mean S.E. Mean S.E. Mean S.E. myb68 36.3 1.0 9.1 0.2 161.6 4.9 WT 32.3 0.8 8.5 0.2 185.3 8.6 Myb68 Mutant and 35S-gMYB68 and 35S-MYB68 Arabidopsis Transgenic Lines Show Salt Tolerance at Seedling Stage

Seeds were sterilized and placed on agar plates with ½ MS growth media containing salt (200 mM NaCl) or no salt (optimal plates) with 6 plates per entry and 30 seeds per plate. After 3 days at 4 C plates were placed in the growth room at 22 C, and 18 hr lights (100 uE) for 16 days. After 7 days plates were scored for germination and after additional 9 days seedlings were scored for bleaching (% white seedling indicative of stress). No differences were found between controls and transgenic lines or the mutant in germination on optimal plates and on salt plates. But after 16 days of salt exposure seedlings showed signs of stress by becoming bleached. Results indicated that the myb68 mutant and the transgenic 35S-Myb68 expressing lines had fewer bleached seedlings which are indicative of lower sensitivity to salt stress.

TABLE 9 Bleached seedlings (% of white seedling) scored after 16 days on 200 mM NaCl containing agar and ½ MS plates. % White Seedlings construct Entry Mean Std Err myb68 (mutant) myb68 51.3 6.5 myb68- 60.1 7.0 null 35S-gMYB68 30 29.3 5.0 73 29.8 4.7 control 41.6 4.6 35S-MYB68 104-3 56.7 4.9  22-7 62.1 5.4 Constitutive Expression of MYB68 in Arabidopsis Results in Yield Protection Following Heat Stress.

The 35S-Myb68 Arabidopsis plants were grown in 3″ pots with 3 plants per pot and 10 replicates per optimal treatment and 12 replicates per heat treatment. All plants were grown under optimal conditions until 2 days into flowering. At that point optimal plants remained in optimal conditions (22 C, 18 hr light of 200 uE, 70% RH) and the test group had a daily heat treatment applied by increasing temperature from 22° C. to 45° C. over a 1 hr ramp period and maintaining that temperature for 1.5 to 2.5 hr for five consecutive days. Plants recovered for a two day period without applied stress after which stress was applied again for an additional three days (total of eight days of heat treatment). Following the heat treatments plants were maintained in optimal conditions till maturity together with the optimal group and final seed yield of both groups was determined. The results indicate that four 35S-Myb68 transgenic lines (22-7, 20-11, 35-1 and 8-6) showed yield protection following the heat stress treatments that ranged from 8 to 21% relative to controls.

TABLE 10 Seed yield per pot from plants grown under optimal and heat stress conditions as described above. Yield protection was calculated as the difference between the seed yield following stress and expressed as % of optimal in transgenic lines and col. yield optimal heat Protection seed yield (g) seed yield (g) % of relative to Entry n Mean Std Err n Mean Std Err opt col 22-7 8 0.449 0.034 12 0.470 0.018 105%  21%  20-11 10 0.194 0.025 11 0.201 0.014 104%  20%  35-1 8 0.500 0.033 12 0.465 0.023 93% 10%   8-6 9 0.517 0.028 12 0.473 0.011 91% 8% 28-12 10 0.572 0.025 12 0.503 0.012 88% 4% 33-3 8 0.155 0.016 11 0.133 0.012 86% 2% 25-6 10 0.395 0.027 12 0.335 0.013 85% 1% 20-7 10 0.571 0.021 12 0.484 0.010 85% (null) 33-10 10 0.546 0.018 12 0.456 0.016 84% (null) Col 10 0.548 0.029 12 0.457 0.020 83%

Example 8: Functional Confirmation of Arabidopsis MYB-Subgroup14 Sequences and Homologues from Other Species Produce the Desired Phenotypes Such as Heat Tolerance

Constitutive Expression of BnMYB68 in Arabidopsis Results in Reduced Flower Abortion Following Heat Stress.

Two closely related Myb68 sequences were identified from Brassica. Their expression patterns differ in that one is expressed predominately in the roots (SEQ ID NO:54), the other in flower buds (SEQ ID NO:56). Plants having constructs expressing either of the BnMyb68 were produced and evaluated. Plants were grown in 2.25″ pots (1/pot) under optimal conditions (22 C, 50% RH, 17 hr light of 200 uE) until 3 days from first open flower. Plants were transferred from 22 C to 43 C for 2-2.5 hr (see tables below). Following this heat stress plants were returned to optimal conditions at 22 C for a week. One week following the heat stress number of aborted flowers was counted. Transgenic lines of 35S-BnMyb68(root) construct showed fewer aborted pods than its control (null). Two transgenic lines of 35S-BnMyb68(bud) construct showed reduced flower abortion following heat stress than their control (null line).

TABLE 11 Number of aborted flowers following 2 hr heat stress at 43 C.-35S-BnMYB68 (root) # # Aborted pods Entry Reps Mean Std Err % Null 36-1 10 6.6 0.9 79% 49-1 12 6.6 0.9 79% 61-4 11 6.8 0.7 82% 70-2 11 7.5 1.1 89% 73-3 12 7.5 0.7 90% 75-7 11 5.9 1.0 71% 78-8 11 6.5 0.9 77% 80-8 12 6.5 0.9 78% null 75-5 11 8.4 0.9

TABLE 12 Number of aborted flowers following 2.5 hr heat stress at 43 C.-35S-BnMYB68 (bud) # # Aborted pods Entry Reps Mean Std Err % Null 11-6 11 4.6 0.4 73% 61-12 12 5.8 0.8 92% Null 97-1 12 6.3 0.6 Constitutive Expression of Soybean MYB84 or Arabidopsis MYB84 or Arabidopsis MYB36 in Arabidopsis Results in Reduced Flower Abortion Following Heat Stress.

Over-expression constructs of soybean-Myb84 (GmMyb84) or Arabidopsis Myb84 (AtMyb84) or Arabidopsis Myb36 (AtMyb36) were made and transformed into Arabidopsis plants functionally confirm the Myb-subgroup14 homologues resulted in heat tolerance as demonstrated by reduced flower abortion under heat stress. Transgenic plants were produced and the T2 generation was used for an initial screen of heat tolerance. Subsequently, T3 homozygous transgenic plants are used for detailed physiological assessment and confirmation of initial results.

The T2 seeds were plated on 0.5× MS agar plates with vitamins (1 plate/flat). Each test group included a positive control, the original heat tolerant mutant myb68, and a corresponding wild type. Seedlings were transplanted to soil on day ten post germination. At the early flowering stage, plants were placed into a heating chamber at 45° C., 65% humidity) for 24-30 minutes. The plants were then placed back into the growing chamber under normal growth conditions (17 h light/7 h dark, 200 μE, 22° C., 70% humidity). Plants were examined on day thirty-two and scored for aborted siliques, partially aborted siliques, dead meristems or normal siliques. A heat stress was deemed effective if a majority of wild type plants had significant flower abortion Transgenic lines were assessed for gene expression by RT-PCR and demonstrated to have elevated expression levels.

Constructs and transgenic plants are produced using homologous of Myb-subgroup14 sequences from desired crop species, for example, rice, corn, wheat, soybean and cotton and evaluated for heat tolerance.

TABLE 13 flower abortion as % Construct of control 35S-Bn MYB68-bud 84 35S-Bn-MYB68-root 82 35S-GmMYB84 58 35S-AtMYB84 77 35S-AtMYB36 81

This example demonstrates that Arabidopsis can be used as a model system to assess and provide conformation that a Mub-subgroup14 sequence can provide heat tolerance and that sequences identified from other plant species are functional in Arabidopsis.

Characterization of Arabidopsis Lines Expressing a 35S-AtMYB36 Construct

Fifteen homozygous transgenic lines at T3 were evaluated in a flower abortion experiment. Plants (fourteen replicates per entry) were grown under optimal conditions (18 hr light, 200 uE, 22 C, 60% RH) in 2.25″ pots until four days from first open flower. At that point plants were exposed to 1 hr treatment at 43 C. Following the heat treatment plants were returned to optimal conditions for 7 more days. The impact of heat treatment on the pod formation was assessed at that point by counting the aborted and damaged (short) pods in the region that was exposed to the heat stress. Nine transgenic lines showed reduction in the total number of damaged pods as compared to controls (wild type-Columbia and the null-n77-4) (see table below). Three of the lines examined here also showed reduced flower abortion at the T2 stage of screening. The myb68 mutant, included as a positive control also showed a reduction in the total number of damaged pods as compared to its segregating null control (myb68-null).

TABLE 14 total # of damaged pods total damaged pods entry Mean Std Err as % of col 108-2 3.2 0.6 65%  14-3 3.7 0.5 76%  36-2 3.8 0.4 78% 103-7 3.8 0.4 78%  43-6 3.8 0.4 78%  23-4 3.9 0.4 81%  67-7 4.0 0.4 82%  82-7 4.4 0.4 90%  40-4 4.5 0.6 93% n77-4 4.1 0.5 85% col 4.9 0.6 100%  myb68 3.1 0.4 72% myb68-null 4.3 0.5 100%  Constitutive or Inducible Expression in Brassica napus of an Arabidopsis MYB68 Shows Reduced Flower Abortion Following Heat Stress

Five transgenic lines having an Arabidopsis Myb68 gene sequence under the control of a constitutive promoter or a heat inducible promoter were evaluated for heat stress tolerance under growth chamber conditions. These lines were at the T2 stage and were heterozygous, with the exception of the 01-105G-1-E line. Analysis of heterozygous lines typically produces greater variation than the same analysis performed on homozygous lines. However, early analysis allows for screening and subsequent analysis of the derived homozygous lines. Segregating nulls and the parent DH12075 were included as controls. The experiment was arranged in a split-plot design with temperature as main factor and transgenic line as subfactor. The plants were grown in 15 cm plastic pots filled with “Sunshine Mix #3” under approximately 500 μmol m⁻² s⁻¹ photosynthetically active radiation at the top of the crop canopy. Molecular analysis was performed to confirm transgene presence. Two groups of plants were included in the test; one group was grown under optimum conditions (22/18° C. day/night temperature, 16-h photoperiod) throughout the growing period, while the second group was subjected to heat stress at 31° C. for 5 hr. per day (ramped-up from 18 to 31° C., then back down to 18° C.). Heat stress conditions were initiated on the third day following the first flower opening and for seven days thereafter. The heat-stressed plants were then returned to optimum conditions until maturity. All racemes on the stressed plants were marked at the beginning and end of the heat stress period, to indicate which flowers had been subjected to heat stress. Viable and aborted pods on the marked racemes were counted and the pod abortion rate calculated as the ratio of aborted pods to aborted plus viable pods under heat-stress conditions, expressed as percent. Seed yield was determined after harvest.

There were significant differences in flower abortion among different lines, with two 35S-Myb68 lines showing significantly reduced abortion rate under heat stress compared to the parental line. Line 02-104G-4-A had a significantly lower abortion rate (33%) than its segregating null (48%) and DH12075 (61%). The abortion rate for line 02-104G-3-K (47%) was also significantly lower than that of the segregating null (66%) and DH12075. The homozygous line (01-105G-1-E) had a slightly lower abortion rate (53%) than its segregating null (58%) and the parental control. Within a population of transgenic lines a gradation of expression levels and physiological response will exist. In part, the gradation of variation is a result of the site of integration of the gene construct and the local environment for gene expression at that locus. This gradation is expected and therefore, lines must be screened and evaluated in order to select the best performing lines. This process is one of routine to one skilled in the art.

Seed yield differed among the tested lines. The yield of three transgenic lines, 02-104G-3-K, 02-104G-4-A and 01-105G-1-E, was similar to that of the parental line, however compared to it's own null the three lines showed between 10% and 22% increase in seed yield. Two transgenic lines (02-33H-1-V and 01-105G-3-G) appeared to under perform compared to the DH12075 parent however, compared to appropriate nulls, one line was significantly lower. Due to the zygosity of these lines such variability is not unexpected.

A similar trend was found for seed number per raceme under heat stress conditions. Lines 02-104G-3-K, 02-104G-4-A and 01-105G-1-E had a slightly higher seed number per raceme but the other transgenic lines (02-33H-1-V and 01-105G-3-G) had significantly lower seed number per raceme compared the parent line. There were no significant differences in 100 seed weight among the tested lines.

In general, two transgenic lines expressing the Arabidopsis Myb68 gene demonstrated significant protection against flower abortion during heat stress imposed at flowering. Additionally, three transgenic lines indicated a trend of increased seed yield.

TABLE 15 abortion Ab % Yield Construct Line rate % S.E. null Yield S.E. % Null 35S-MYB68 02-104G-3-K 47 4.8 71 0.71 0.188 122 02-104G-3-G null 66 3.7 0.58 0.091 02-104G-4-A 33 7.6 69 0.75 0.153 110 02-104G-4-F null 48 2.8 0.68 0.090 02-33H-1-V 79 1.9 116 0.31 0.159 44 02-33H-1-F null 68 4.3 0.71 0.090 P18.2-MYB68 01-105G-1-E 53 4.1 91 0.73 0.097 116 01-105G-1-B null 58 3.0 0.63 0.090 01-105G-3-G 75 4.9 94 0.31 0.099 96 01-105G-3-J null 80 3.6 0.32 0.097 Control DH12075 parent 61 2.4 0.62 0.106

Example 9: Identification of MYB-Subgroup14 Sequences and Homologues

Methods for identification of Arabidopsis MYB sequences, classification of MYB sequences into designated subgroups and identification of MYB-subgroup14 sequences are further described in Stracke et al., 2001 and Kranz et al., 1998. MYB-subgroup14 sequences have been is defined as a nucleotide or protein sequence comprising an Arabidopsis the characteristics described herein. The MYB-subgroup14 is a R2R3 MYB sequence that additionally comprises a conserved motif or motifs as described by the following patterns.

The general pattern (SEQ ID NO:266) provides a sequence that can be used to identify a candidate MYB-subgroup14 sequence. At some positions multiple amino acid residues are permitted at a given location. Where multiple amino acids are permitted, the optional residues are indicated within square brackets. Where a R2R3 MYB protein sequence fits the general pattern it is likely to be a MYB-subgroup14 sequence. A MYB-subgroup14 sequence may be less than identical to the general pattern, for example it may be 90%, 95% or 99% identical.

If a candidate MYB-subgroup14 sequence, matching the general pattern further includes a match to the exclusive pattern (SEQ ID NO:267) then the sequence is a strong candidate for inclusion as a MYB-subgroup14 sequence. The exclusive sequence (SEQ ID NO:267) defines the pattern of amino acids that are present in MYB-subgroup14 sequences but may differ in other R2R3 MYB proteins. Presence of the exclusive pattern within a MYB protein is a strong indicator that the MYB is a member of the MYB-subgroup14 family.

If a candidate MYB-subgroup14 sequence, matching the general pattern further includes a match to the absolute pattern (SEQ ID NO:268) then the sequence is a strong candidate for inclusion as a MYB-subgroup 14 sequence. The Absolute pattern (SEQ ID NO:268) represents sequence residues present in all MYB-subgroup14 sequences analyzed to date.

For the general, exclusive, and absolute patterns (SEQ ID NOs:266-268) listed below, “X” denotes any amino acid, “X(N)”, where N is any number, denotes a string of the indicated number of “X”s, (i.e., X(23) denotes a string of 23 “X”s), where X is any amino acid. At some positions multiple amino acid residues are permitted at a given location. Where multiple amino acids are permitted, the optional residues are indicated within square brackets.

General Pattern (SEQ ID NO: 266) M-G-R-X-P-C-C-D-[KR]-X-X-[MV]-K-[RK]-G-X W-[SA]-X- [DQE]-E-D-X-X-[IL]-[RK]-X-[FY]-X-X-X-X-G-X-X-X- [SN]-W-I-X-X-P-X-[RK]-X-G-[IL]-X-R-C-G-[KR]-S-C-R- L-R-W-[IL]-N-Y-L-R-P-X-[IL]-[RK]-H-G-X-[FY]-[ST]- X-X-E-[DE]-X-X-[IV]-X-X-X-[FY]-X-X-X-G-S-[KR]-W-S- X-[MI]-A-X-X-[ML]-X-X-R-T-D-N-D-[ILV]-K-N-[HY]-W- [DN]-[ST]-[RK]-L-[RK]-[RK]-[RK] Exclusive Pattern (SEQ ID NO: 267) [RK]-X(9)G-X-X-I-X(28)-H-X(14)-[YF]-X(4)-S-X(23)- [RK] Absolute Pattern (SEQ ID NO: 268) G-X-W-X-X-X-E-D-X-X-[IL]-[RK]-X-X-X-X-X-X-G-X(23)- R-W-[IL]-N-Y-L-R-P-X-[IL]-[RK]-H-G-X-[FY]-X-X-X-E- [DE]-X(13)-W-X-X-X-A-X-X-X-X-X-R-T

MYB-subgroup14 sequences can be defined by the consensus sequence of SEQ ID NO:266. Positions have been identified in which the amino acid residues are found exclusively or predominately in the MYB-subgroup14 sequences. In the following description all position numbers are in reference to Arabidopsis MYB68 protein (SEQ ID NO:2) which correlates to the general consensus sequence above, SEQ ID NO:266.

Within the R2 domain at position 26, a positively charged (K or R) residue is conserved in MYB-subgroup14. Although this amino acid is not exclusive to this subgroup at this position, the tendency for the rest of the Arabidopsis MYB proteins is for a hydrophobic residue, with over 50% of all MYB proteins having a hydrophobic isoleucine or valine residue (Stracke et al., 2001).

At position 36, all members of subgroup 14 contain an insertion, resulting in an extra amino acid residue. This appears to be exclusive to MYB-subgroup14. Glycine (G), a small hydrophobic residue, is the extra residue in all cases except in MYB87 (SEQ ID NO:32), which contains an asparagine residue at this position and a rice homologue (SEQ ID NO:194) which contains an argentine at this position.

At position 39, members of the MYB-subgroup14 predominantly contain a hydrophobic isoleucine residue. One exception to this is a rice homologue identified as SEQ ID NO:191, which possesses a glutamine at this position. Although other hydrophobic residues are generally found in MYB proteins at this position, isoleucine appears to be exclusive to MYB-subgroup14. Additionally, positively charged residues (39% R), and polar residues (23% N) are most prevalent at this position.

Within the R3 domain at position 68, all MYB-subgroup14 members contain the positively charged histadine residue. A positively charged residue appears at this position in all MYB proteins; however in contrast to the MYB-subgroup14, in other MYB proteins 73% contain an arginine (R) and 17% contain a lysine residue.

At position 83, most members of MYB-subgroup14 contain an aromatic hydrophobic residue (tyrosine Y, or phenylalanine F). The appearance of an aromatic hydrophobic residue at this position seems to be exclusive to MYB-subgroup14. The majority of MYB proteins have a histidine residue (87%). This histidine has been suggested to be a crucial residue in the hydrophobic core of the helix (Ogata et al., 1992). Through NMR analysis, it appears to be in contact with two of the critical tryptophan residues. The absence of a histidine at position 83 does not necessarily exclude it as a member of MYB-subgroup14 as members have been identified that do possess a histidine residue, for example SEQ ID NO:169 and SEQ ID NO:225.

At position 88, members of MYB-subgroup14 contain the polar serine residue, while almost all other MYB proteins (91%) contain the polar asparagine residue. A serine residue appears to be exclusive to this subgroup. The absence of a serine at position 88 does not necessarily exclude it as a member of MYB-subgroup14 as at least one MYB-subgroup14 member has been identified that possesses a phenylalanine residue in position 88, for example SEQ ID NO:256.

At position 112, members of MYB-subgroup14 contain a positively charged arginine or lysine residue. The majority of MYB proteins also contain positively charged residues at this position, however, histadine (48%) is the most prevalent residue found. The absence of a arginine at position 112 does not necessarily exclude it as a member of MYB-subgroup14 as at least one MYB-subgroup14 member has been identified that possesses a glutamic acid residue in position 112, for example SEQ ID NO:68 contains a glutamic acid residue and SEQ ID NO:191 contains a threonine.

Variation within a R2R3 domain is permissible as shown in the identified consensus sequences. A R2R3 domain of a MYB-subgroup14 sequence may be 90% homologous, preferably 95% homologous or more preferably 99% homologous to the consensus sequence presented.

The Addition of S1 and S2 Motifs

Within MYB-subgroup14 the MYB68 and MYB84 sequences contain two further conserved motifs identified as S1 (SFSQLLLDPN) (SEQ ID NO:269) and S2 (TSTSADQSTISWEDI) (SEQ ID NO:270). These motifs are found in both Arabidopsis and Brassica MYB68, MYB36 and MYB84 sequences and show at least 70% homology within the amino acid sequence. Additionally, homology to the S1 or S2 motifs was found to exist in orthologs in Brassica napus (Canola), Brassica rapa (Cabbage), Brassica oleracea, Raphanus raphanistrum (Radish), and homology to the S2 region in Poncirus trifoliate (Orange), and weak homology in a Medicago trunculata homologue and Vitis vinifera (Grape) homologue within the S2 region.

For inclusion as a S1 or S2 motif target sequences show homology of at least 70%, more preferably 80% and most preferably 95%.

The MYB68 and MYB84 sequences from species other than Arabidopsis and Brassica may not contain a 51 and S2 motif but may still be classified as a MYB subgroup-14 sequence based on sequence analysis and inclusion of other criteria.

Identification of MYB-Subgroup14 Members, Including MYB68, Homologues

Homologues of an Arabidopsis MYB-subgroup14 sequence (Table 1) or a desired MYB68 (SEQ ID NO:1, SEQ ID NO:2), can be found using a variety of public or commercial software that is known to those skilled in the art. Blast alignments can be performed and putative sequences identified. Searches can be performed as outlined herein. The top homologues are determined using programs such as tblastn, tblastp, searches against available databases in NCBI, such as the EST, GSS, HTG and chromosomal databases, as well as other genomic databases, such as the TIGR unigene database, Cucurbit genomics database, Sunflower and Lettuce, Medicago truncatula (International Medicago Genome Annotation Group), SGN, and Orange. In instances where species were more highly divergent, the alignment parameters such as Gap costs, matrix values can be appropriately changed.

To confirm the top hit to AtMYB68 in each species is in fact a MYB68 homologue, a reciprocal blast can be performed, in which the homologue is blasted against all Arabidopsis proteins. In many cases the homologue's closest Arabidopsis hit is to one of MYB68's gene family members (a MYB subgroup-14), instead of MYB68 itself. In cases where the homologue is closest to an Arabidopsis protein outside of the MYB68 gene family, the homologue is assessed not to be a MYB-subgroup14 member.

Open reading frames may be determined using programs such as “getorf” from the EMBOSS program, or ESTScan.

Methods for identification of MYB sequences, classification of MYB sequences into designated subgroups and identification of MYB-subgroup14 sequences are further described in Stracke et al., 2001 and Kranz et al., 1998.

Weakly conserved homologues between highly divergent species are often not found using traditional blast methods. In such cases, conserved motifs, domains and fingerprints will exist between homologues, and are good predictors of functional homology. Many programs exist that are proficient at finding conserved domains across species using hidden markov models, position-specific-scoring matrices, and patterns. PSI-Blast, PRATT, PHI-Blast, and HMMBuild/HMMSearch.

PRATT is a tool provided by the PROSITE database. It generates conserved patterns from a group of conserved proteins. PRATT was used to determine a conserved pattern between MYB68 and its closest homologues. ScanProsite and PHI-BLAST was then used to look for the conserved pattern in the Swiss-Prot and NCBI protein databases respectively. The search results are limited to alignments that also contain the pattern.

HMMBuild was used to build a hidden markov model using the MYB68 and its homologues. HMMSearch was used to scan NCBI's protein database using the hidden markov model. Similar proteins to the basic blastp search were found.

Utilizing the above methods, MYB68 homologues were found in over 50 different plant species. Homology was restricted in most cases to the N-terminal MYB DNA binding domain. Homology in the less conserved C-terminal region existed in genes found in Brassica napus (Canola), Brassica rapa (Cabbage), Brassica oleracea, Raphanus raphanistrum (Radish), Poncirus trifoliate (Orange), and weak homology in genes found in Medicago trunculata homologue and Vitis vinifera (Grape) within the S2 region.

The Poncirus trifoliate and Brassica rapa genes were identified by downloading strong EST hits, and assembling them using CAP3. The resulting contigs did not code for a complete protein. The contig sequence from Orange coded for a partial protein spanning only the S2 region, and in Brassica rapa, a partial 202 aa protein spanning from 1-202 in AtMYB68 was found.

A variety of programs have characterized the MYB domain with profiles, patterns, and hidden markov models. To confirm the presence of the MYB domain in a unknown sequence, a sequence can be searched against these profiles. InterProScan is particularly useful as it provides an interface to query 13 programs simultaneously. The databases incorporated in InterPro include

a) ProDom: a database of protein domain families. Built by clustering homologous segments from Swiss-Prot/Trembl database, followed by recursive PSI-BLAST searches.

b) HMMTIGR: Protein families represented by Hidden Markov Models.

c) TMHMM: Predicton of transmembrane helices in proteins.

d) FPrintScan: Searches the PRINTS database of fingerprints. Fingerprints are protein families represented by multiple motifs.

e) ProfileScan: Profiles from family related sequences.

f) HMMPanther: A database of hidden markov models

g) HMMPIR: Hidden markov models based on evolutionary relationship of whole proteins.

h) ScanRegExp: Scans the prosite database of patterns and profiles

i) Gene3D: a database of proteins containing functional information

j) HMMPfam: Protein domain families represented by hidden markov models.

k) Superfamily: a database of structural and functional protein annotations for all completely sequenced organisms.

l) HMMSmart: Allows the identification of mobile domains based on hidden markov models.

m) SignalIP: predicts the presence and location of signal peptide cleavage sites in amino acid sequences.

Blocks are multiply aligned ungapped segments corresponding to the most highly conserved regions of proteins. The MYB domain is represented by three blocks. As expected AtMYB68 contained each of these three blocks, as well as 1 of the 5 Wos2 blocks.

The invention is based in part on the discovery of plants that are heat stress tolerant. The gene responsible for the heat tolerant phenotype has been determined and shown to be a MYB68 gene. Methods of producing a heat tolerant transgenic plant are disclosed herein. Specifically the invention identifies a transcription factor gene family, specifically the MYB gene family, and in particular a MYB-subgroup14 that when expressed in plants results in plants that are heat stress tolerant and have an increased yield following a heat stress or display tolerance to drought stress or salt stress.

Example 10: Identification of MYB68 Homologues

Homologues from the same plant, different plant species or other organisms were identified using database sequence search tools, such as the Basic Local Alignment Search Tool (BLAST) (Altschul et al. (1990) J. Mol. Biol. 215:403-410; and Altschul et al. (1997) Nucl. Acid Res. 25: 3389-3402). The tblastn or blastn sequence analysis programs were employed using the BLOSUM-62 scoring matrix (Henikoff, S. and Henikoff, J. G. (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919). The output of a BLAST report provides a score that takes into account the alignment of similar or identical residues and any gaps needed in order to align the sequences. The scoring matrix assigns a score for aligning any possible pair of sequences. The P values reflect how many times one expects to see a score occur by chance. Higher scores are preferred and a low threshold P value threshold is preferred. These are the sequence identity criteria. The tblastn sequence analysis program was used to query a polypeptide sequence against six-way translations of sequences in a nucleotide database. Hits with a P value less than −25, preferably less than −70, and more preferably less than −100, were identified as homologous sequences (exemplary selected sequence criteria). The blastn sequence analysis program was used to query a nucleotide sequence against a nucleotide sequence database. In this case too, higher scores were preferred and a preferred threshold P value was less than −13, preferably less than −50, and more preferably less than −100.

Alternatively, a fragment of a sequence from SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 is ³²P-radiolabeled by random priming (Sambrook et al., (1989) Molecular Cloning. A Laboratory Manual, 2^(nd) Ed., Cold Spring Harbor Laboratory Press, New York) and used to screen a plant genomic library (the exemplary test polynucleotides) As an example, total plant DNA from Arabidopsis thaliana, Nicotiana tabacum, Lycopersicon pimpinellifolium, Prunus avium, Prunus cerasus, Cucumis sativus, or Oryza sativa are isolated according to Stockinger al (Stockinger, E. J., et al., (1996), J. Heredity, 87:214-218). Approximately 2 to 10 μs of each DNA sample are restriction digested, transferred to nylon membrane (Micron Separations, Westboro, Mass.) and hybridized. Hybridization conditions are: 42.degree. C. in 50% formamide, 5×SSC, 20 mM phosphate buffer 1×Denhardt's, 10% dextran sulfate, and 100 μg/ml herring sperm DNA. Four low stringency washes at RT in 2×SSC, 0.05% sodium sarcosyl and 0.02% sodium pyrophosphate are performed prior to high stringency washes at 55° C. in 0.2.times.SSC, 0.05% sodium sarcosyl and 0.01% sodium pyrophosphate. High stringency washes are performed until no counts are detected in the washout according to Walling et al. (Walling, L. L., et al., (1988) Nucl. Acids Res. 16:10477-10492).

Example 11: Identification of MYB68 Technical Features

A MYB68 gene can be identified by identifying genes that have high homology to an Arabidopsis MYB68 (SEQ ID NO:1). In addition to having homology of the nucleotide or amino acid sequence, one may identify candidate genes that share conformational protein structure. Such structural motifs may assist in identification of related proteins and their structure and function relationships.

Example 12: Functional Confirmation of Homologues

Candidate homologues are introduced into Arabidopsis and assessed for heat tolerance. Genes disclosed as SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 are expressed in Arabidopsis plants and heat tolerance assessed as described herein. Optionally, the expression of If a candidate MYB-subgroup14 sequence, matching the general pattern further includes a match to the exclusive pattern then the sequence is a strong candidate for inclusion as a MYB-subgroup14 genes or MYB68 genes can be evaluated in any transformable species, for example, Brassica, maize, cotton, soybean or rice. Examples of such functional testing have been provided in this disclosure.

It is noted that some of the sequences disclosed herein are not full length. Full length sequences can be obtained by standard molecular methods known to those of skill in the art. The full length sequences can then be expressed as described herein. 

What is claimed is:
 1. A method of producing a heat stress tolerant plant, comprising: a) providing a nucleic acid encoding a plant MYB subgroup-14 polypeptide, wherein said plant MYB subgroup-14 polypeptide is MYB68; b) inserting said nucleic acid into a vector; c) transforming a plant, a tissue culture, or a plant cell with said vector to obtain a transformed plant, tissue culture, or plant cell; d) increasing expression of said plant MYB subgroup-14 polypeptide in said transformed plant, tissue culture, or plant cell as compared to a wild type plant of the same species not transformed with said vector; e) growing said transformed plant or regenerating a plant from said transformed tissue culture or plant cell, wherein a heat stress tolerant plant is produced which has an increased heat stress tolerance as compared to a wild type plant of the same species not transformed with said vector; and f) selecting the heat stress tolerant plant by exposing the plant to heat stress conditions.
 2. The method according to claim 1, wherein said vector comprises a constitutive promoter or an inducible promoter. 